Understanding Space in Proof Complexity: 
Separations and Trade-offs via Substitutions 



Eli Ben-Sasson* 
Computer Science Department 
Technion — Israel Institute of Technology 
Haifa, 32000, Israel 

eli@cs . technion .ac.il 

Jakob Nordstrom^ 
Computer Science and Artificial Intelligence Laboratory 
Massachusetts Institute of Technology 
Cambridge, MA 02 1 39, US A 

jakobn@mit . edu 

August 12, 2010 



Abstract 

For current state-of-the-art satisfiability algorithms based on the DPLL procedure and clause learn- 
ing, the two main bottlenecks are the amounts of time and memory used. In the field of proof complexity, 
these resources correspond to the length and space of resolution proofs for formulas in conjunctive nor- 
mal form (CNF). There has been a long line of research investigating these proof complexity measures, 
but while strong results have been established for length, our understanding of space and how it relates 
to length has remained quite poor. In particular, the question whether resolution proofs can be optimized 
for length and space simultaneously, or whether there are trade-offs between these two measures, has 
remained essentially open apart from a few results in restricted settings. 

In this paper, we remedy this situation by proving a host of length-space trade-off results for resolu- 
tion in a completely general setting. Our collection of trade-offs cover almost the whole range of values 
for the space complexity of formulas, and most of the trade-offs are superpolynomial or even exponential 
and essentially tight. Using similar techniques, we show that these trade-offs in fact extend (albeit with 
worse parameters) to the exponentially stronger fc-DNF resolution proof systems, which operate with 
formulas in disjunctive normal form with terms of bounded arity k. We also answer the open question 
whether the fc-DNF resolution systems form a strict hierarchy with respect to space in the affirmative. 

Our key technical contribution is the following, somewhat surprising, theorem: Any CNF formula 
F can be transformed by simple variable substitution into a new formula F 1 such that if F has the right 
properties, F' can be proven in essentially the same length as F, whereas on the other hand the minimal 
number of lines one needs to keep in memory simultaneously in any proof of F' is lower-bounded by 
the minimal number of variables needed simultaneously in any proof of F, Applying this theorem to 
so-called pebbling formulas defined in terms of pebble games on directed acyclic graphs, we obtain our 
results. 
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1 Introduction 

A central theme in the field of propositional proof complexity is the study of limitations of natural proof 
systems. Such a study is typically conducted by considering a complexity measure of propositional proofs 
and investigating under which circumstances this measure is large. The most thoroughly examined com- 
plexity measure is that of proof size/length. The interest in this measure is motivated by its connections to 
the NP vs. CO-NP problem (since by I CR79I . proving superpolynomial lower bounds for arbitrary proof 
systems would separate NP and CO-NP), by methods for proving independence results in first order theories 
of bounded arithmetic (for an example, see [ Ajt88 ]), and because lower bounds on proof length imply lower 
bounds on the running time of algorithms for solving NP-complete problems such as SATISFIABILITY (such 
algorithms are usually referred to as SAT solvers). 



Proof space This paper focuses on a more recently suggested complexity measure known as space. 
The space measure was first defined and studied by Esteban and Toran [ET01| in the context of the fa- 
mous resolution proof system introduced by Blake [Bla37|, and was generalized to other proof systems by 
Alekhnovich et al. in [ABRW02|. Roughly speaking, the space of a proof corresponds to the minimal size of 
a blackboard needed to give a self-contained presentation of the proof, where the correctness of each step is 
verifiable from what is currently on the blackboard. The interest in space complexity stems from two main 
sources that we survey next. 

First, there are intricate and often surprising connections between space and length. For resolution, it 
follows from the elegant results of Atserias and Dalmau [AD08| that upper bounds on space imply upper 
bounds on length. Esteban and Toran [ET01 1 showed the converse that length upper bounds imply space 
upper bounds for the restricted case of tree-like resolution. Recall that the tree-like version of a sequential 
proof system has the added constraint that every line in the proof can be used at most once to derive a sub- 
sequent line. In terms of space, a proof is tree-like if any claim appearing on the blackboard must be erased 
immediately after it has been used to derive a new claim. Another question which has attracted interest is 
whether space and length can display trade-offs, that is whether there are formulas having proofs in both 
short length and small space, but for which there are no proofs in short length and small space simulta- 
neously. Such length-space trade-offs have been established in restricted settings in IBen09l lNor0 9b P but 



nothing has been known for refutations of explicit formulas in general, unrestricted resolution. 

A second motivation to study space is its connection to the memory consumption of SAT solvers. For 
instance, the family of backtracking heuristics suggested by IDP601 IDLL62 I and known as Davis-Putnam- 
Logemann-Loveland (DPLL) SAT solvers have the following property. When given as input an unsatisfiable 
formula F in conjunctive normal form — called henceforth a CNF formula — the description of the execution 
of a DPLL SAT solver corresponds to a tree-like resolution proof refuting F. Thus, lower bounds on tree-like 
refutation space imply lower bounds on the memory consumption of DPLL SAT solvers, much like lower 
bounds on tree-like refutation length imply lower bounds on the running time of DPLL heuristics. During 
the last 10-15 years, a family of SAT solvers known as DPLL with clause learning [SS96 BS97] (denoted 
DPLL+) has been put to practical use with impressive success. For instance, an overwhelming majority of 
the best algorithms in recent rounds of the international SAT competitions (see [SAT|) belong to this class. 
These SAT solvers have the property that an execution trace corresponds to a (non-tree-like) resolution 
refutation. Hence, space lower bounds in general resolution translate into memory lower bounds for these 
algorithms, and length-space trade-offs could have implications for trade-offs between time efficiency and 
memory consumption. 



1 A proof system is said to be sequential if a proof ty in the system is a sequence of lines 7r = {Li , . . . , L T } where each line is 
derived from previous lines by one of a finite set of allowed inference rules (See Section|3|for formal definitions). 
2 A related result, claimed in | HP07 1, has been later retracted by the authors in | HP 10 1. 
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We end this discussion by pointing out that there is still much left to explore regarding the connection 
between space lower bounds in proof complexity and memory consumption of SAT solvers. On the one 
hand, the memory consumption of a "typical" DPLL+ SAT solver can be far greater than the theoretical 
upper bounds on refutation space. On the other hand, the theoretical lower bounds on refutation space are 
worst-case bounds for non-deterministic algorithms, and hence apply even to the most memory-efficient 
proofs theoretically possible, which is not remotely close to the kind of proofs produced by a typical SAT 
solver. Understanding what kind of practical implications one can get on the memory consumption of SAT 
solvers from refutation space lower bounds remains an interesting open problem. 

/c-DNF resolution The family of sequential proof systems known as k-DNF resolution was introduced 
by Krajfcek [KraOl | as a intermediate step between resolution and depth-2 Frege. Roughly speaking, for 
integers k > the kth member of this family, denoted henceforth by 9\(k), is only allowed to reason in terms 
of formulas in disjunctive normal form (DNF formulas) with the added restriction that any conjunction in 
any formula is over at most k literals. For k = 1, the lines in the proof are hence disjunctions of literals, and 
the system !SK(1) = is standard resolution. At the other extreme, 9l(oo) is equivalent to depth-2 Frege. 

The original motivation to study this family of proof systems, as stated in [KraOl |, was to better un- 
derstand the complexity of counting in weak models of bounded arithmetic, and it was later observed that 
these systems are also related to SAT solvers that reason using multi- valued logic (see [JN02| for a discus- 
sion of this point). By now a number of works have shown superpolynomial lower bounds on the length of 
9t(fc) -refutations, most notably for (various formulations of) the pigeonhole principle and for random CNF 
formulas |AB04 ( ABE02 1 lAletBl IJN021 lRaz03l ISBI041 |Seg05[ . Of particular relevance to our current work 
are the results of Segerlind et al. [SBI04| and of Segerlind [Seg05 1 showing that the family of 9l(&;)-systems 
form a strict hierarchy with respect to proof length. More precisely, they prove that there exists a constant 
e > such that for every integer k > there exists a family of formulas {F n }^ =1 of arbitrarily large size 
n such that F n has a fR(k + l)-refutation of polynomial length but all £K(£;)-refutations of F n require 
exponential length 2 n \ 

Just as in the case for standard resolution, the understanding of space complexity in A;-DNF resolution 
has remained more limited. We are aware of only one prior work by Esteban et al. [EGM04| shedding light 
on this question. Their paper establishes essentially optimal linear space lower bounds for 9\(k) and also 
prove that the family of tree-like 9i(k) systems form a strict hierarchy with respect to space. What they show 
is that there exist arbitrarily large formulas F n of size n that can be refuted in tree-like 9t(A; + 1) in constant 
space but require space $7(n/log 2 n) to be refuted in tree-like *R(k). It should be pointed out, however, 
that as observed in IK ra01lEGM 04l the family of tree-like JH(fe) systems for all k > are strictly weaker 
than standard resolution. As was noted above, the family of general, unrestricted SH(fe) systems are strictly 
stronger than resolution, so the results in [EGM04 1 leave completely open the question of whether there is a 
strict space hierarchy for (non-tree-like) 9\(k) or not. 

Definition Of length and space As a final point before turning to our results, we briefly and informally 
recall what is meant by "length" and "space" (formal definitions will follow in Section|3ll. We view a refuta- 
tion of an unsatisfiable CNF formula F as being presented on a blackboard. The refutation is represented as 
a sequence of sets of A;-DNF formulas it = {Do> • • • ? D T }, where B> t is a snapshot of the blackboard at time t. 
In particular, Do should be the empty set, D r should contain the contradictory empty formula, and at time t 
we can go from from Dt_i to B) t by ( i) writing a clause of F (an axiom) on the blackboard, ( ii) erasing a line 
from the board, or ( Hi) inferring a new line from those lines present on the board according to the inference 
rules of A;-DNF resolution. We do not discuss the details of these rules here, since the exact definitions in 
fact do not matter — the lower bounds we prove hold for any arbitrarily strong (but sound) rules. What is 
important is that the only new formulas that can be derived are those implied by the set of formulas that are 
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currently on the blackboard, and that these formulas are all fc-DNFs. 

The length of a refutation is the number of formulas appealing in the refutation counted with repetitions, 
or equivalently (within a factor of 2) the number of derivation steps. There are several different ways to 
measure the space of a set D t in our refutation. The crudest way is to count the number of A;-DNF formulas 
on the board, i.e., to measure the size of Of. We call this the formula space, or simply, space of O*. (For 
resolution, i.e., when k = 1, this is the well-studied measure of clause space.) A finer granulation is to 
measure the total space — the number of appearances of literals in D t , counted with repetitions. Formula 
space and total space are the two space measures that have received the most attention in previous research, 
and they are also the focus of the current paper. A third, closely related, measure that will also be of interest 
to us is variable space, defined as the number of distinct variables appearing on the board. It is easily seen 
that variable space is a lower bound on total space. For all of these space measures, the space of a refutation 
7r = {Do, . . . ,D T } is the maximal space of any fc-DNF set H) t in it. 

1.1 Our Results in Brief 

Space Hierarchy for A;-DNF resolution Our first main result is that Krajicek's family of /c-DNF res- 
olution proof systems form a strict hierarchy with respect to space. More precisely, we separate fc-DNF 
resolution from (k + 1)-DNF resolution in the sense that we exhibit for every k a family of explicitly con- 
structing CNF formulas of size n that can be refuted in constant formula space and linear length simul- 
taneously in (k + 1)-DNF resolution (i.e., they are very easy with respect to both measures) but have the 
property that any /c-DNF resolution, no matter how long or short, must by necessity use at least order of 



Length-space trade-offs Our second main result is a collection of strong length-space trade-offs for 
fc-DNF resolution. For k = 1, i.e., standard resolution, these are the first trade-off results for resolution 
refutations of explicit formulas in the general, unrestricted resolution proof system, thus eliminating all 
technical restrictions in the previous works [Ben09 ( Nor09b|. For k > 1, to the best of our knowledge no 
trade-offs have been known even in restricted settings. 

We also want to emphasize two other novel aspects of our results. First, as was discussed above there are 
several different ways of measuring space, and previous papers have focused on one particular measure and 
proven results specifically for that measure. Our techniques, however, allow us to obtain results that hold for 
both total space and formula space (i.e., the largest and smallest space measure) simultaneously. Second, 
our upper bounds hold for the standard syntactic version of the proof systems, where new formulas can be 
derived from two existing formulas by a limited set of structural rules, whereas the lower bounds hold for 
semantic versions where new formulas can be derived from an unlimited set of formulas by arbitrary sound 
rules. The reason this is worth pointing out is that in general, semantic fc-DNF resolution proof systems are 
known to be exponentially stronger than syntactic fc-DNF resolution. 

We give the formal statements of our trade-offs in Section |2j but a general template for the kind of 
trade-off theorems we are able to prove is as follows. 

Theorem 1.1 (Trade-off theorem template (informal)). Let K be a fixed positive integer and let s\ a (n) 
and Shi( n ) be suitable function such that s\ (n) <C Shi( n ) = 0(n/loglogn). Then there are explicitly 
constructible CNF formulas {Fn}^=i of size O(n) and width 0(1) (with constants depending on K) such 
that the following holds: 

• The formulas F n are refutable in syntactic resolution in (small) total space 0(s\ o (n)). 

3 A family of formulas is explicitly constructible if there exists a polynomial time algorithm that on input l n produces the nth 
member of the family. 




formula space. 
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• There are also syntactic resolution refutations TT n of F n in simultaneous length 0(n) and (much 
larger) total space 0(shi(n)). 

• However, any resolution refutation, even semantic, in formula space o(s^(n)) must have superpoly- 
nomial or sometimes even exponential length. 

• Even for the much stronger semantic k-DNF resolution proof systems, k < K, it holds that any 
9\(k)-refutation of F n in formula space o( Shi( n )) must have superpolynomial length (or expo- 
nential length, correspondingly). 

We instantiate this theorem template for a wide range of space functions s\ (n) and Shi{n) from constant 
space all the way up to nearly linear space. This is in contrast to INor09bl . where the trade-off results are 
obtained only for a very carefully selected ratio of space to formula size. Moreover, our trade-offs are robust 
in that they are not sensitive to small variations in either length or space (as in [Nor09b |). That is, intuitively 
speaking they will not show up only for a SAT solver being unlucky and picking just the wrong threshold 
when trying to hold down the memory consumption. Instead, any refutation having length or space in the 
same general vicinity will be subject to the same qualitative trade-off behaviour. 

1.2 Overview of Technical Contributions 

We want to highlight three technical contributions underpinning the results discussed informally above. 

Substitution space theorems Our first key technical contribution is a general way to derive strong 
space lower bounds in resolution from weak lower bounds on the number of variables that occur simultane- 
ously in a proof. Very loosely, we show the following: Suppose we have a formula that has short refutations 
but where any such short refutation must mention many variables at some point. Then by making variable 
substitutions in this formula and expanding the result into a CNF formula in the natural way, this new for- 
mula will still have short refutations, but now any such refutation must use lots of space, in the sense that 
lower bounds on variable space will translate into lower bounds on formula space. 

We believe that this generic procedure of transforming weak space lower bounds into stronger ones is 
an interesting result in its own right that sheds new light on space measures in proof complexity. To support 
this point, we strengthen the theorem by showing that not only can we obtain strong resolution lower bounds 
in this way, but it is also possible to lift weak resolution lower bounds to strong lower bounds in other more 
powerful proof systems, namely fc-DNF resolution systems. We remark that this general idea of "hardness 
amplification" in proof complexity has also been used in the recent work of Beame et al. [BHP09 1, although 
the actual techniques there appear somewhat orthogonal to ours (and, in particular, incomparable in the 
sense that it seems neither paper can be used to derive the results in the other). 

Minimally unsatisfiable A;-DNF sets One crucial ingredient in the proof of the substitution space 
theorem for resolution is analyzing the structure of sets of disjunctive clauses that imply many other clauses. 
Intuitively, it seems reasonable that if the set of implied clauses is sufficiently large and disjoint, the set of 
clauses implying all these clauses cannot itself be too small. One important special case of this is for clause 
sets containing many variables but being minimally unsatifiahle — that is, every clause places a necessary 
constraint on the variables to enforce unsatifiability and if just one arbitrary clause is removed from the set, 
then the rest can be satisfied. It is well known that such a clause set must contain strictly less variables than 
clauses, and we can use similar proof techniques to derive the more general result that we need. 

When we want to extend our theorem to /c-DNF resolution, it becomes essential to understand instead 
the structure of sets of A;-DNF formulas that imply many other A;-DNF formulas. Here there are no previous 
results to build on, as the proof techniques that yield tight results for disjunctive clauses can be shown to 
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break down fundamentally. Instead, we have to develop new methods. One important step along the way 
is to understand the structure of minimally unsatisfiable sets of /c-DNF formulas, which appears to be a 
natural combinatorial problems of independent interest. We prove that a minimally unsatisfiable /c-DNF set 
of size m can contain at most m k+l variables, and this bound turns out to be tight up to an additive one 
in the exponent in view of recent joint work [ NR09 1 of Razborov and the second author. 

Reductions between resolution and pebbling Using the substitution space theorems, we can con- 
struct reductions between (/c-DNF) resolution on the one hand and so-called pebble games played on di- 
rected acyclic graphs (DAGs) on the other. In one direction, this reduction is easy, but the other direction 
is nontrivial. Moreover, our reductions are time- and space-preserving. This allows us (modulo some tech- 
nical complications which we ignore for the moment) to translate known trade-off results for pebbling into 
corresponding trade-offs for resolution. This is done by transforming the pebble game played on a DAG G 
into a CNF formula that encodes this particular problem instance of the game, and showing that this formula 
has similar trade-off properties in resolution as the DAG G has for the pebble game. 

With hindsight, such a correspondence might seem more or less obvious, so let us stress that this is not 
the case. Pebble games on graphs and 9^(A;)-refutations of CNF formulas are very different objects. Once we 
have translated a pebbling instance into a CNF formula, it is not at all clear why a lR(/c)-prover refuting this 
formula would have to care about how it was constructed. There might be shortcuts in the proof complexity 
world that do not correspond to anything meaningful in the pebbling world. And indeed, reading previous 
literature on pebbling formulas in proof complexity reveals a few such surprising shortcuts, and there has 
been no consensus on what properties these formulas are likely to have in general. 

What we show is that for the right flavour of pebbling formulas, any prover refuting such formulas must 
in effect reason in terms of pebblings. More precisely, we show that given any 9t(k)-refutation, no matter 
how it is structured, we can extract from it a pebbling of the underlying DAG, and this pebbling has at least 
as good time and space properties as the refutation from which it was extracted. In other words, the pebbling 
formula inherits the time-space trade-off properties of the DAG in terms of which it is defined. This allows 
us to draw on the rich literature on pebbling trade-offs from the 1970s and 1980s, as well as on newer results 
by the second author in I N or 1 Obi , to obtain strong trade-offs in proof complexity. 

1.3 Organization of the Rest of This Paper 

In Section |2j we present formal statements of our main results and — in order not to let all the notation 
and terminology obscure what is in essence a clean and simple proof construction — briefly outline some of 
the key ingredients in the proofs. Having presented our results and sketched the main ideas in the proof 
constructions, in Section |3] we then give the necessary preliminaries for the formal proofs that will follow. 
Our first main technical contribution, the substitution space theorem for resolution, is proven in Section |4] 
In Section |5j we extend this theorem to /c-DNF resolution, along the way establishing upper and lower 
bounds on the size of minimally unsatisfiable sets of /c-DNF formulas. Our final technical tool, namely 
the method for converting strong pebbling trade-offs into length-space trade-offs for resolution, is described 
in Section |6] We apply this tool in Section to prove our collection of length-space trade-off results. 
Concluding the paper, in Section[8]we briefly discuss some open questions. 

2 Formal Statements of Results and Outline of Proofs 

In what follows, let us write L(tt) to denote the length of a resolution refutation and Sp(tv), TotSp{ir), and 
VarSp{iv) to denote the formula space, total space and variable space, respectively. Taking the minimum 
over all resolution refutations of F, we let Lr^(F h 0) denote the length of a shortest refutation, and 
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Sptft(F h 0), TotSptft(F h 0), and VanSp^F h 0) are defined completely analogously. These definitions 
are also generalized to for general k. To state our results it will also convenient to use the notation 
W(tt) for the width of a standard resolution refutation, i.e., the size of a largest clause in it, and Wsr(F h 0) 
for the minimal width of any standard resolution refutation of F. Again, more formal definitions are given 
in Section |3] 

2.1 Substitution Space Theorems 

If F is a CNF formula over variables x, y, z, . . . and / : {0, l} d \-> {0, 1} is a Boolean function over 
d variables, we can obtain a new CNF formula by substituting f{x%, . . . , Xd) for every variable x and then 
expand to conjunctive normal form. We will write F\f\ to denote the resulting substitution formula. For 
example, for the disjunctive clause C = x\/y and the exclusive or function / = x\ © x 2 we have 

C[f} = (x 1 Vx 2 Vy 1 Vy 2 )A(x 1 Vx 2 \/y 1 Vy 2 ) 
A (xi V x 2 V yx V y 2 ) A (xi Vx 2 V^V y 2 ) . 

We say that / is k-non-authoritarian if no partial assignment to any subset of k variables can fix the value 
of / to true or false and that / is k-authoritarian otherwise. For instance, the XOR function © on d + 1 
variables is <i-non-authoritarian, as is the majority function on 2d + 1 variables. If / is 1-non-authoritarian 
(1-authoritarian) we say that the function is simply non-authoritarian {authoritarian). For example, non- 
exclusive or V of any arity is always authoritarian. 

Loosely put, the substitution space theorem for resolution says that if a CNF formula F can be refuted 
in resolution in small length and width simultaneously, then so can the substitution formula F[fd\. There is 
an analogous result in the other direction as well in the sense that we can translate any refutation ttj of F[fd] 
into a refutation ir of the original formula F where the length of it is almost upper-bounded by the length 
of 7Tf (this will be made precise below). So far this is nothing very unexpected, but what is more interesting 
is that if fd is non-authoritarian, then the clause space of ttj is an upper bound on the number of variables 
mentioned simultaneously in ir. Thus, the theorem says that we can convert (weak) lower bounds on variable 
space into (strong) lower bounds on clause space by making substitutions using non-authoritarian functions. 

Theorem 2.1 (Substitution space theorem for resolution). Let F be any unsatisfiable CNF formula and 
fd be any non-constant Boolean function of arity d. Then F[fd] can be refuted in resolution in width 

Wn[F[f d ] h0) =0(d- %(Fh0)) , 

length 

Lm(F[f d ] h 0) < mm {L(ir) • exp(0(d • W(ir)))} , 

and total space 

TotS Pm (F[f d ] h 0) < min { TotSp(ir) • exp(0(d • W{tt)))} . 

7r:_FhO 

In the other direction, any semantic resolution refutation ttj : F[fd] h of the substitution formula can be 
transformed into a syntactic resolution refutation ir : F h of the original formula such that the number 
of axiom download^ in ir is at most the number of axiom downloads in ttj. If in addition fd is non- 
authoritarian, it holds that Sp(jTf) > VarSp{ir), i.e., the clause space of refuting the substitution formula 
F[fd] is lower-bounded by the variable space of refuting the original formula F. 

4 It would have been nice if the bound in terms of number of axiom downloads could have be strengthened to a bound in terms 
of length, but this is not true. The reason for this is that the proof refuting F[fd] is allowed to use any arbitrarily strong semantic 
inference rules, and this can lead to exponential savings compared to syntactic resolution. To see this, just let F be an encoding of, 
say, the pigeonhole principle and let 7T/ be the refutation that downloads all axioms of F[fd] and then derives contradiction in one 
step. Luckily enough, though, the bound in terms of axiom downloads turns out to be exactly what we need for our applications. 
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Note that if F is refutable simultaneously in linear length and constant width, then the bound in Theo- 
remOon L(F[f d ] h 0) becomes linear in L(F h 0). 

The substitution space theorem for fc-DNF resolution extends Theorem 12. II by telling us that for fc-non- 
authoritarian functions /, we can translate back and forth between standard resolution refutations of F and 
%K{k) -refutations of the substitution formula F[f] in a (reasonably) length- and space-preserving way. When 
the "proof blackboard" contains /c-DNFs instead of disjunctive clauses, the analysis becomes much more 
challenging, however, and the bounds we are able to obtain become correspondingly worse. Below, we 
state the theorem with asymptotic factors hidden by the asymptotic notation to make it easier to parse. The 
complete version is given in Section|5] 

Theorem 2.2 (Substitution space theorem for £>DNF resolution). Let F be any unsatisfiable c-CNF for- 
mula and fd be any non-constant Boolean function of arity d, and suppose furthermore that c, d, and k are 
universal constants. Then the following two properties hold for the substitution formula F[fd]: 

l.IfF can be refuted in syntactic standard resolution in length L and total space s simultaneously, then 
F[fd] can be refuted in syntactic SH(ci) in length O(L) and total space O(s) simultaneously. 

2- If fd i s k-non-authoritarian and F[fd] can be refuted by a semantic 9K(k)- refutation that requires for- 
mula space s and makes L axiom downloads, then F can be refuted by a syntactic standard resolution 
refutation that requires variable space at most 0(s fc+1 ) and makes at most L axiom downloads. 

The proofs of Theorems 12.11 and 12.21 are inspired by our recent work [ BN08 ] and indeed our main 
theorem there can be seen to follow from Theorem 12. II Let us discuss the new aspects of the more general 
theorems presented in this paper. First and foremost, our results extend to %K(k) for k > 1 whereas the 
previous theorem applies only to resolution. Second, our previous statement only hold for a very special 
kind of formulas (namely the pebbling formulas discussed above) whereas Theorems 12.11 and 12.21 can be 
used to convert any CNF formula requiring large variable space into a new and closely related CNF formula 
requiring large formula space. Third, in this paper we get length-preserving as well as space-preserving 
reductions, whereas it was unclear how to derive similar reductions from our previous work. And length- 
preserving reductions are crucial for our length-space trade-offs described below. 

We will return to these theorems and sketch the main ingredients in the proofs in Sections [2.41 and 1231 
but before that we want to describe why these tools will be so useful for us. We do so next. 

2.2 Translating Resolution Refutations to Pebblings 

The pebble game played on a DAG G models the calculation described by G, where the source vertices 
contain the inputs and non-source vertices specify operations on the values of the predecessors. Placing a 
pebble on a vertex v corresponds to storing in memory the partial result of the calculation described by the 
subgraph rooted at v. Removing a pebble from v corresponds to deleting the partial result of v from memory. 
Black pebbles correspond to deterministic computation and white pebbles to nondeterministic guesses. A 
pebbling V of G is a sequence of moves starting with the graph being completely empty and ending with all 
vertices empty except for a black pebble on the (unique) sink vertex. The time of a pebbling is the number 
of pebbling moves and the space is the maximal number of pebbles needed at any point during the pebbling. 

The pebble game on the graph G can be encoded as an unsatisfiable CNF formula Peb G saying that the 
sources of G are true and that truth propagates through the graph in accordance with the pebbling rules, but 
that the sink is false. Given any black-only pebbling V of G, we can mimic this pebbling in a resolution 
refutation of Peb G by deriving that a literal v is true whenever the corresponding vertex in G is pebbled 
(this was perhaps first observed in [BiW04]). 
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Lemma 2.3 (| BIW04 1). Let G be a DAG with unique sink and bounded vertex indegree. Then given any 
complete black pebbling V of G, we can construct a standard resolution refutation ir : Peb G h such that 
L(vr) = 0(time(V)), W(ir) = 0(1), and TotSp(ir) = 0(space{V)). 

In the other direction, we start with the result of the first author [Ben09| that if we take any refutation 
of a pebbling contradiction and let positive and negative literals correspond to black and white pebbles 
respectively, then we get (essentially) a legal black-white pebbling of the underlying DAG. That is not quite 
what we need, however, since it only provides a weak bound in terms of variable space. 

This is where Theorems 12. ll and l2.2l come into play. If we make substitutions in Peb G with suitably non- 
authoritarian functions, the upper bounds in Lemma 1231 remain true (with adjustments in constant factors), 
while the lower bounds are lifted from variable space to formula space. For simplicity, we only state the 
lower bounds in the special case for standard resolution below. 

Theorem 2.4. Let f be any non-authoritarian Boolean function and G be any DAG with unique sink and 
bounded indegree. Then from any standard resolution refutation ir : Peb G [f] hO we can extract a black- 
white pebbling strategy V^for G such that tlme^P^) = 0(L(~k)) and space^^) = 0(Sp(n)). 

2.3 Space Separations and Length-Space Trade-offs 

Combining the theorems in Section |2~T1 with the reductions between resolution and pebble games in Sec- 
tion 12.21 we can now establish our space separation and length-space trade-off results. Let us start by 
formally stating the space hierarchy theorem for £K(k). 

Theorem 2.5 (fc-DNF resolution space hierarchy). For every k > 1 there exists an explicitly constructible 
family {F n }^ =1 of CNF formulas of size 0(n) and width 0(1) such that 

• there are D\(k + 1) -refutations i: n : F n h in simultaneous length L(ir n ) = 0(n) and formula space 
Sp(ir n ) = 0(1), but 

• any 9\(k) -refutation of F n requires formula space fi( k+ y/ n / log n) . 
The constants hidden by the asymptotic notation depend only on k. 

The families {F n }^ =1 are obtained by considering pebbling formulas defined in terms of the graphs in 
IGT781 requiring pebbling space 0(n/ logn), and substituting a fc-non-authoritarian Boolean function / of 
arity k + 1, for instance XOR over k + 1 variables, in these formulas. 

Moving on to our length-space trade-offs, in the remainder of this section we try to highlight some of 
the results that we find to be the most interesting. A fuller and more detailed account of our collection of 
trade-off results is given in Section We reiterate that all of our results are for explicitly constructible 
formulas, and that in addition most of the constructions are actually very clean and transparent in that they 
are obtainable from pebbling formulas over simple families of DAGs. 

From the point of view of space complexity, the easiest formulas are those refutable in constant total 
space, i.e., formulas having so simple a structure that there are resolution refutations where we never need to 
keep more than a constant number of symbols on the proof blackboard. A priori, it is not even clear whether 
we should expect that any trade-off phenomena could occur for such formulas, but it turns out that there are 
quadratic length-space trade-offs. 

Theorem 2.6 (Quadratic trade-offs for constant space). For any fixed positive integer K there are explic- 
itly constructible CNF formulas {Fn}^^ of size Q(n) and width 0(1) such that the following holds (where 
all multiplicative constants hidden in the asymptotic notation depend only on K): 

• The formulas F n are refutable in syntactic resolution in total space TotSp<^(F n h 0) = 0(1). 
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• For any Shi(«) = 0(^/nj there are syntactic resolution refutations ir n of F n in simultaneous length 
L{^n) = 0((n/s m (n)) 2 ) and total space TotSp(n n ) = 0(st,i(n)). 

• For any semantic resolution refutation n n : F n \~0 in formula space (i.e., clause space) Sp(ir n ) < 
s hi{ n ) it holds that L(ir n ) = $7((ra/,s ft i(n)) 2 ). 

• For any k < K, any semantic k-DNF resolution refutation n n of F n in formula space Sp(TT n ) < 
Shi(n) must have length L(ir n ) = f2 (jip/{ s u{ n ) 1 ^ k+1 ^)) 2 \ In particular, any constant-space 
D\(k) -refutation must also have quadratic length. 

Theorem 12.61 follows by combining our machinery with the seminal work on pebbling trade-offs by 
Lengauer and Tarjan [LT82| and the structural results on simulations of black- white pebblings by resolution 
in INorlObl . 

Remark 2.7. Notice that the trade-off applies to both formula space and total space. This is because the 
upper bound is stated in terms of the larger of these two measures (total space) while the lower bound is 
in terms of the smaller one (formula space). Note also that the upper bounds hold for the usual, syntactic 
versions of the proof systems, whereas the lower bounds hold for the much stronger semantic systems, and 
that for standard resolution the upper and lower bounds are tight up to constant factors. These properties of 
our results are inherited from the substitution space theorems, and they hold for all our trade-offs stated here 
and in Section Finally, we remark that we have to pick some arbitrary but fixed limit K for the size of the 
terms when stating the results for A;-DNF resolution, since for any family of formulas we consider there will 
be very length- and space-efficient 9\(k) -refutation refutations if we allow terms of unbounded size. 

Our next result relies on a new pebbling trade-off result in [Nor 10b |, building on earlier work by Carlson 
and Savage [CS80 CS82|. Using this new result, we can derive among other things the rather striking 
statement that for any arbitrarily slowly growing non-constant function, there are explicit formulas of such 
(arbitrarily small) space complexity that nevertheless exhibit superpolynomial length-space trade-offs. 

Theorem 2.8 (Superpolynomial trade-offs for arbitrarily slowly growing space). Let s\ (n) = w(l) be 
any arbitrarily slowly growing function^ and fix any e > and positive integer K. Then there are explicitly 
constructible CNF formulas {F n }^ =1 of size Q(n) and width 0(1) such that the following holds: 

• The formulas F n are refutable in syntactic resolution in total space TotSp^(F n h 0) = 0(s\ (n)). 

• There are syntactic resolution refutations 7r n of F n in simultaneous length L(n n ) = O(n) and total 

) 2 ) 1/3 )- 

• Any semantic resolution refutation of F n in clause space O ^ (n/s\ (n) 2 ^ ^ S must have superpoly- 
nomial length. 

• For any k < K, any semantic 9\(k) -refutation of F n in formula space O ^(ra/si (n) 2 ) 1 ^ 3( ' fc+1 ^ e 
must have superpolynomial length. 

All multiplicative constants hidden in the asymptotic notation depend only on K, e and S\ Q . 



space TotSp(-7r n ) = O I (n/s\ (n 



5 For technical reasons, let us also assume here that s\ (n) = Oiji 1 ^ 7 ^, i.e., that si Q (n) does not grow too quickly. This 
restriction is inconsequential since for such fast-growing si Q (n) other trade-off results presented below will yield much stronger 
bounds. 
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Observe the robust nature of this trade-off, which is displayed by the long range of space complexity 
in standard resolution, from w(l) up to « n 1 / 3 , which requires superpolynomial length. Note also that the 
trade-off result for standard resolution is very nearly tight in the sense that the superpolynomial lower bound 
on length in terms of space reaches up to very close to where the linear upper bound kicks in. 

The two theorems above focus on trade-offs for formulas of low space complexity, and the lower bounds 
on length obtained in the trade-offs are somewhat weak — the superpolynomial growth in Theorem 12.81 is 
something like n Sl °( n \ We next present a theorem that has both a stronger superpolynomial length lower 
bounds than Theorem 12.81 and an even more robust trade-off covering a wider (although non-overlapping) 
space interval. This theorem again follows by applying our tools to the pebbling trade-offs in [LT82]. 

Theorem 2.9 (Robust superpolynomial trade-off for medium-range space). For any positive integer K, 
there are explicitly constructible CNF formulas {F n }^ =1 of size 0(n) and width 0(1) such that the follow- 
ing holds (where the hidden constants depend only on K): 

• The formulas F n are refutable in syntactic resolution in total space TotSp<^(F n h 0) = 0(log 2 n). 

• There are syntactic resolution refutations of F n in length 0(n) and total space 0(n/ log n). 

• Any semantic resolution refutation of F n in clause space Sp(ir n ) = o(n/logn) must have length 

L(TT n ) = ra n(loglogrO 

• For any k < K, any semantic fR(k) -refutation informula space 5p(7T n ) = o ((n/ log n) 1 /( fc+1 ') must 
have length L(ir n ) = n n ( lo s lo s n ). 

Having presented trade-off results in the low-space and medium-space range, we conclude by presenting 
a result at the other end of the space spectrum. Namely, appealing one last time to yet another result 
in [LT82], we can show that there are formulas of nearly linear space complexity (recall that any formula is 
refutable in linear formula space) that exhibit not only superpolynomial but even exponential trade-offs. 

We state this final theorem only for standard resolution since it is not clear whether it makes sense 
for 9\(k). That is, we can certainly derive formal trade-off bounds in terms of the (k + l)st square root 
as in the theorems above, but we do not know whether there actually exist 9l(fc)-refutation in sufficiently 
small space so that the trade-offs apply. Hence, such trade-off claims for 9\(k), although impressive looking, 
might simply be vacuous. We can obtain other exponential trade-offs for SR(fc) (see Section0for the details), 
but they are not quite as strong as the result below for resolution. 

Theorem 2.10 (Exponential trade-offs for nearly-linear space). Let n be any sufficiently large constant. 
Then there are CNF formulas F n of size 0(n) and width 0(1) and a constant k' <C k such that: 

• The formulas F n have syntactic resolution refutations in total space n' ■ n/ logn. 

• F n is also refutable in syntactic resolution in length 0(n) and total space 0(n) simultaneously. 

• However, any semantic refutation of F n in clause space at most k ■ nj log n has length exp(n f2 ^ 1 ^). 

To get a feeling for this last trade-off result, note again that the lower bound holds for proof systems with 
arbitrarily strong derivation rules, as long as they operate with disjunctive clauses. In particular, it holds for 
proof systems that can in one step derive anything that is semantically implied by the current content of the 
blackboard. Recall that such a proof system can refute any unsatisfiable CNF formula F with n clauses in 
length n + 1 simply by writing down all clauses of F on the blackboard and then concluding, in one single 
derivation step, the contradictory empty clause implied by F. In Theorem l2. lOl this proof system has space 
nearly sufficient for such an ultra-short refutation of the whole formula. But even so, when we feed this 
proof system the formulas F n and restrict it to having at most 0(n/ log n) clauses on the blackboard at any 
one given time, it will have to keep going for an exponential number of steps before it is finished. 
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2.4 Proof Ingredients for Substitution Space Theorem for Resolution 

Before embarking on the formal proofs of our theorems, we want to provide some intuition for the substi- 
tution space theorems that are the keys to our results. Let us first focus on the result for standard resolution 
and describe the proof structure in some detail. The analogous result for fc-DNF resolution is proven in a 
similar way, but with the added technical complications that we need to prove size bounds on sets of fc-DNF 
formulas. These issues, and in particular our result for minimally unsatisfiable sets of fc-DNF formulas, are 
discussed in Section l2~5l 

Thus, let F be any unsatisfiable CNFformla and fd any non-authoritarian Boolean function (as described 
in Section l2!Tt . and let F[fd] denote the CNF formula obtained by substituting f(x\, . . . ,xj) for every 
variable x in F and expanding the result to conjunctive normal form. 

The first part of Theorem 12.11 that any resolution refutation of F can be transformed into a refutation 
of F[fd] with similar parameters, is not hard to prove. Essentially, whenever the refutation of the original 
formula F writes a clause C on the blackboard, we write the corresponding set of clauses C[fd] on the 
blackboard where we are refuting the substitution formula. We make the additional observation that if we 
take any resolution refutation 7r of F and write down the new refutation wf of C[fd] resulting from this 
transformation — assuming for concreteness that the function is exclusive or, say — it is easy to verify that 
the number of variable occurrences in it, i.e., the variable space, translates into a lower bound on the number 
of clauses in -kj, i.e., the formula space (which as we recall is called clause space for standard resolution). 
Equation (I2.lt provides an example of this variable-space-to-clause-space blow-up. 

It is more challenging, however, to prove the reverse direction that we can get lower bounds on clause 
space for F[fd] from lower bounds on variable space for F. Ideally, we would like to claim that any prover 
refuting F[f d ] had better write down to the blackboard clause sets on the form C[fd] corresponding to 
clauses C in some refutation of the original CNF formula F, and that if he or she does not, then we can 
analyze the refutation as if that is what is happening anyway, just ignoring the clauses that do not fit into this 
framework. 

To argue this more formally, we need to specify how sets of clauses in a refutation of F[fd] should 
be translated to clauses in a purported refutation of F. We do this by devising a way of "projecting" any 
refutation of F[fd] down on a refutation of F. These "projections" are defined in terms of a special kind of 
"precise implication" which we describe next. Recall that for Boolean functions F and G, we say that F 
implies G, denoted F 1= G, if any truth value assignment satisfying F must also satify G. 

Definition 2.11 (Precise implication and projected clauses (informal)). Suppose that D is a set of clauses 
over variables in Vars(F[fd}) and that P and N are (disjoint) subset of variables of F. If any truth value 
assignment satisfying D must also satisfy Vxgp/d(^) V V^eiV ^fdiv) Dut this is not the case for strict 
subsets P' ^ p or N' ^ N, we say that the clause set O implies \J xeP fd(%) v V^gtv "'fdiv) precisely. 

Let us write any clause C as C = C + V C - , where C + = VxeL«t(C) x * s tne disjunction of the positive 
literals in C and C~ = Vye£#(C) V * s tne disjunction of the negative literals. Then we say that D projects C 
if D implies \J x&c + fdffl v Vygc- "'fdiv) precisely, and we write proj F {p) to denote the set of all clauses 
that D projects on F. 

Given this definition, we want to take any refutation ttj = {Do,Bi, . . . ,D T } of F[fd] and argue that 
7r = {proj F (Do), proj p{p>\), . . . , proj F (D T )} is (essentially) the refutation of F that we are looking for. 

It is not hard to see that for a "well-behaved" prover refuting F[fd] using a refutation it of the original 
formula F as a template but substituting the clause set C[fd] for every clause C appearing on the blackboard, 
applying the projection in Definition 12. Ill at every step in the derivation will give us back the refutation tt 
of F that we started with (the reader can check that this is the case for instance for the clause set in (I2.lt ). 
What is more remarkable is that this projection of refutations of F[fd] always works no matter what the 
prover is doing, in the sense that the result is always a resolution refutation of the original formula F and 
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this projected refutation does not only (essentially) preserve length, which is not too complicated to show, 
but also space. We refer to Section|4]for the formal statements and proofs. 

2.5 Substitution Space Theorem for 9t(fc) and Minimally Unsatisfiable /-DNF Sets 

The proof of the first part of Theorem 12. 21 is again reasonably straightforward and resembles our proof of 
the substitution theorem for the standard resolution proof system. For the second part, however, we require 
a result, described next, that bounds the number of variables appearing in a minimally unsatisfiable fc-DNF 
set of a given size. Since this result addresses a combinatorial problem that appears to be interesting (and 
challenging) in its own right, we describe it in some detail below. 

We start by recalling that a set of 1-DNF formulas, i.e., a CNF formula, is said to be minimally unsat- 
isfiable if it is unsatisfiable but every proper subset of its clauses is satisfiable, and try to generalize this 
definition to the case of k > 1. Perhaps the first, naive, idea how to extend this notion is to define D to be 
minimally unsatisfiable if it is unsatisfiable but all proper subsets of it are satisfiable. This will not work, 
however, and the set of formulas 

{x, ((x A yi) V (x A y 2 ) V (x A y 3 ) V • • • V (x A y n j) } (2.2) 

shows why this approach is problematic. The set (12.2b . which consists of two 2-DNF formulas, is unsatis- 
fiable but every proper subset of it is satisfiable. However, the number of variables appearing in the set can 
be arbitrarily large so there is no way of bounding | Vars (Dj | as a function of |D|. 

A more natural requirement is to demand minimality not only at the formula level but also at the term 
level, saying that not only do all DNF formulas in the set have to be there but also that no term in any 
formula can be shrunk to a smaller, weaker term without the set becoming satisfiable. Luckily enough, this 
also turns out to be the concept we need for our applications. The formal definition follows next. 

Definition 2.12 (Minimal implication and minimally unsatisfiable fc-DNF sets). Let D be a set of /c-DNF 
formulas and let G be a formula. We say that D minimally implies G if D N G and furthermore, replacing 
any single term T appearing in a single DNF formula with a proper subterm of T, and calling the 

resulting DNF set W, results in W \f G. If G is unsatisfiable we say D is minimally unsatisfiable. 

To see that this definition generalizes the notion of a minimally unsatisfiable CNF formula, notice that 
removing a clause C' from a CNF formula F is equivalent to replacing a term of C' , which is a single 
literal, with a proper subterm of it, which is the empty term. This is because the empty term evaluates to 1 
on all assignments, which means that the resulting clause also evaluates to 1 on all assignments and hence 
can be removed from F. With this definition in hand, we are thus interested in understanding the following 
problem: 

Given a minimally unsatisfiable set of m k-DNF formulas, what is an upper bound on the 
number of variables that this set of formulas can contain? 

As was noted above, for k = 1 the set D is equivalent to a CNF formula, because it is a set of disjunctions 
of literals, and we have the following "folklore" result which seems to have been proved independently on 
several different occasions (see IAL86UBET0T1lCS88IIKu"l00l \ 

Theorem 2.13. IfB is a minimally unsatisfiable CNF formula, then \ Vars(D)\ < |D|. 

Theorem l2.13l has a relatively elementary proof based on Hall's marriage theorem, but its importance to 
obtaining lower bounds on resolution length and space is hard to overemphasize. For instance, the seminal 
lower bound on refutation length of random CNFs given by Chvatal and Szemeredi in [CS88 1 makes crucial 
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use of it, as does the proof of the "size- width trade-off" of [BW01 1. Examples of applications of this theorem 
in resolution space lower bounds include [ABRW02 BG031 lBN08l INor()9al iNHUSll . 

For sets of A;-DNF formulas with k > 1, we are not aware of any upper or lower bounds on mini- 
mally unsatisfiable sets prior to our work. The main technical result that we need in order to establish the 
A;-DNF resolution space hierarchy is the following extension of Theorem l2.13l to the case of k > 1. 

Theorem 2.14. Suppose that D is a minimally unsatisfiable k-DNF set. Then the number of variables in D 
is at most | Vars(B)\ < (k • |B|) fc+1 . 

Proof sketch. Let us sketch the proof for k = 2. (The full proof is given in Section|5]) Suppose that we have 
a 2-DNF set D with m formulas mentioning ri(m 3 ) variables. Then there is at least one 2-DNF formula D* 
mentioning f2(m 2 ) variables. By the definition of minimality, the set O \ {D} is satisfiable. Let a be some 
minimal partial assignment fixing D \ {D} to true, and note that a needs to set at most 2(m — 1) variables 
(at most one 2-term per formula). 

Consider the 2-terms in D*. If there are 2m terms over completely disjoint pairs of variables, then there 
is some 2-term a A b untouched by a. If so, a can be extended to a satisfying assignment for all of D, which 
is a contradiction. Hence there are at most O(m) terms over disjoint sets of variables. 

But D* contains S7(m 2 ) variables. By counting (and adjusting the implicit constant factors), there must 
exist some literal a* in D* occuring in a lot of terms (a* A b\) V (a* A b 2 ) V • • • (a* A &2m)- Again by 
minimality, there is a (partial) truth value assignment a' satisfying D \ {D} and setting a* to true. (To see 
this, note that shrinking, for instance, a* A b\ to a* should make the whole set satisfiable). But if we pick 
such an a 1 of minimal size, there must exist some 6j that is not falsified and we can extend a 1 to a satisfying 
assignment for a* A bi and hence for the whole set. Contradiction. □ 

We want to point out that in contrast to Theorem l2.131 which is exactly tight (consider the set 

{ViLi^i) -^i, ^x 2 , ^x n ) (2.3) 

of n + 1 clauses over n variables), there is no matching lower bound on the number of variables in Theo- 
rem 12.141 The best explicit construction that we were able to obtain, stated next, has number of variables 
only linear in the number of A;-DNF formulas (for k constant), improving only by a factor k 2 over the bound 
for CNF formulas in Theorem l2.13l 

Lemma 2.15. There are minimally unsatisfiable k-DNF sets D with \ Vars (D)| > fc 2 (|D| — 1). 

Proof sketch. Consider any minimally unsatisfiable CNF formula consisting of n+1 clauses over n variables 
(for instance, the one in (I2.3I) ). Substitute every variable Xi with 

(xj A xf A • • • A x^) V (x* +1 A x^ +2 A • • • A x 2k ) V • • • V (xf ~ fc+1 A xf ~ k+2 A • • • A xf) (2.4) 

and expand every clause to a /c-DNF formula. Note that this is possible since the negation of MAI that we 
need to substitute for -<Xi can also be expressed as a /c-DNF formula 

\J (-xf A • • • A -xf ) . (2.5) 

(jl,... ) j fc )€[l,fe]X...X[(fc2-fc+l,fc 2 ] 

It is straightforward to verify that the result is a minimally unsatisfiable /c-DNF set in the sense of Defini- 
tion l2. 121 and this set has n + 1 formulas over k 2 n variables. □ 

In our first preliminary report [ BN09 1 on our results for fc-DNF resolution, we stated that we saw no 
particular reason to believe that the upper bound in Theorem l2. 14l should be tight, hinting that Lemma l2.15l 
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might well be closer to the truth. Surprisingly to us, this turned out to be wrong. In a joint work [NR09] 
with Razborov, the second author recently showed that there are minimally unsatisfiable /c-DNF sets with 
m formulas and « m k variables, which means that Theorem 12. 141 is tight up to an additive one in the 
exponent. 

Concluding this section, we remark that the precise statement required to prove the second part of 
Theorem 12. 21 is somewhat more involved than Theorem 12.141 However, the two proofs follow each other 
very closely. We refer to Section [5] for the details. 

3 Preliminaries 

In this section we give the basic formal definitions used in this paper and state a few well-known facts that 
we will need. 

3.1 Formulas, Assignments and Restrictions 

For the most part we use standard notation for formulas in conjunctive normal form (CNF) and disjunctive 
normal form (DNF). However, we will also use the very convenient, although perhaps slightly less standard, 
set notation to treat objects such as clauses, terms, restrictions and CNF and DNF formulas. We explain this 
notation and terminology next. Throughout this paper, we let [n] denote the set {1,2,..., n}. 

DNF and CNF formulas as sets For x a Boolean variable, a literal over x is either a Boolean variable 
x, called a positive literal over x, or its negation, denoted —>x or x and called a negative literal over x. 
Sometimes it will also be convenient to write x 1 for unnegated variables and x° for negated ones. We define 
-■-■a; to be x. When x is understood from context or unimportant we simply speak of a (positive, negative) 
literal. A CNF formula is a set of clauses, i.e., disjunctions of literals, and a DNF formula is a set of terms, 
i.e., conjunctions of literals. The variable set of a term T, denoted Vars(T), is the set of Boolean variables 
over which there are literals in T and Lit(T) is the set of literals. The variable and literal sets of a clause 
are similarly defined and these definitions are extended to CNF and DNF formulas by taking unions. If X is 
a set of Boolean variables and Vars(T) Clwe say T is a term over X and similarly define clauses, CNF 
formulas, and DNF formulas over X. 

We will sometimes think of clauses and terms as sets of literals and borrow set-theoretic notation and 
terminology to discuss logical formulas. For instance, we say that the term T" is a subterm of T, and write 
T' C T to denote that the set of literals of T' is contained in the set of literals of T. We similarly speak of, 
and denote, subclauses and subformulas. We say the clause C (or term T) is a k-clause (k-term, respectively) 
if \C\ < k (\T\ < k, respectively). A k-DNF formula D is a set of £;-terms and a k-CNF formula is a 
set of /c-clauses. We define the size S(F) of any formula F to be the total number of literals in F counted 
with repetitions. More often, we will be interested in the number of clauses \F\ of a CNF formula F or the 
number of terms in a DNF formula. We write \D\to den ote is the number of terms in a fc-DNF formula and 
correspondingly for the number of clauses in a CNF formula. 

Assignments and restrictions as sets As is the case with CNF and DNF formulas, we prefer to use 
in our proof a set-theoretic representation of restrictions and assignments, as defined next. 

A restriction p over a set of Boolean variables X is a subset of literals over X with the property that for 
each variable x € X there is at most one literal over x in p. The set of variables assigned by p is Vars (p) 
and the size of p is \p\ = \ Vars (p) |. We say the restriction p' extends p if p' D p, and in this case we also 
say that p agrees with p'. An assignment a to X is a restriction satisfying \a\ = \X\. 
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For a a literal over X and p a restriction over X, let the restriction of a under p be 



a| P = < 



1 a G p 

-.a G p (3.1) 
a otherwise 



If a\ p = 1 we say p satisfies a, if aL = we say p falsifies a and otherwise we say p leaves a unfixed. We 
extend the definition of a restriction to a term T = ai A • • • A a s and clause C = a[ V ■ ■ ■ V a' s as follows. 
Let -i/o = {-ia|a G p} denote the restriction obtained by replacing every literal in p by its negation. 



1 T C p 

o , cr, 

T \ p otherwise 



CCn^ 

1 C n p / (3.2) 
C\->p otherwise 



In words, we say T is satisfied by p if p satisfies all literals in T, we say T is falsified by p if some literal of 
p is falsified and otherwise T is unfixed by p. Dually, C is satisfied if some literal of it is satisfied by p, it is 
falsified if all its literals are falsified by p and otherwise it remains unfixed. Notice that the empty term, i.e., 
the term of size 0, is satisfied by every restriction and the empty clause is falsified by all of them. We extend 
the definition of a restriction to a DNF formula D = D\\J ■ ■ ■ V D m = {Di, . . . , D m } by 

1 3ie[m},Di\ p = l 

Di\ p = 0,i€[m] (3.3) 

{Alp : Alp + 0} otherwise, 

and to a CNF formula F = Ci A • • • A C m = {Ci, . . . , C m } by 

f 3iG [m],Qr p = 

^tp = < 1 Ci\ p = l,i€[m] (3.4) 

I {Ci\ p : Ci\ p ^l} otherwise. 

The notions of a restriction satisfying, falsifying and leaving unfixed a DNF or CNF formula are analogous to 
those defined for terms and clauses. If p is a restriction satisfying a formula F, yet every proper subrestriction 
p 1 C p does not satisfy F, then we say p is a minimal satisfying restriction. A minimal falsifying restriction 
is analogously defined. We write p(~^C) to denote the minimal restriction fixing a clause C to false, i.e., 

p(pC) = {a | a e C}. 

When a is a truth value assignment to the variables in a formula F, it will be notationally convenient to 
think of a as assigning a truth value to F. We write a(F) to denote this assigned value, or sometimes F\ a 
when we think of a as a restriction. We will use these two notations interchangeably. We write 

a^(y)= H y) l^ X > (3.5) 

to denote the truth value assignment that agrees with a everywhere except possibly at x, to which it assigns 
the value v. 

A term (clause, respectively) is said to be trivial if it contains both a positive and a negative literal 
over the same variable. We may assume without loss of generality that all terms (clauses, respectively) 
appearing in our paper are nontrivial, because the value of a DNF (CNF, respectively) remains unchanged 
after addition or removal of trivial terms (clauses, respectively). We say that a DNF formula D over X 
represents a Boolean function / : X — > {0, 1} if and only if for all assignments a G {0, 1} X , we have 
f(a) = D(a). The notion of a CNF formula representing / is analogously defined. Clearly, every Boolean 
function / can be represented by a DNF formula (for instance, the disjunction of all terms corresponding to 
satisfying truth value assignments) as well as by a CNF formula (for instance, the conjunction of all clauses 
ruling out falsifying assignments for /). 



15 



UNDERSTANDING SPACE IN PROOF COMPLEXITY 



Implication If C is a set of formulas we say that a restriction (or assignment) satisfies C if and only if it 
satisfies every formula in C. For D, C two sets of formulas over a set of variables X, we say that D implies C, 
denoted D N C, if and only if every assignment a to X that satisfies D also satisfies C. In particular, D 1= 
if and only if D is unsatisfiable, i.e., no assignment satisfies D. 

3.2 /-DNF Resolution 

We now give a more precise description of the fc-DNF resolution proof systems and the proof complexity 
measures for these systems that we are interested in studying. 

Definition 3.1 (fc-DNF-resolution inference rules). The k-D~NF -resolution proof systems are a family of 
sequential proof systems parameterized by k G N + . Lines in a fc-DNF-resolution refutation are /c-DNF for- 
mulas and the following inference rules are allowed (where A,B,C denote /c-DNF formulas, T, T' denote 
/c-terms, and ai, . . . , ojt denote literals): 

k-CUt ("lA...Av)Vfl, v yV...V^VC ) where y < L 

A-introduction a\j\tat') » as ^ on § as \T^T \ < k. 
f\-elimination for any T' C T. 

Weakening A ^ B for any A;-DNF formula B. 

The formulas above the line are called the inference assumptions and the formula below is called the conse- 
quence. For brevity we denote by 9\(k) the proof system of fc-DNF resolution. 

When we want to study length and space simultaneously in resolution, we have to be slightly careful 
with the definitions in order to capture length-space trade-offs. Just listing the clauses used in a resolution 
refutation does not tell us how the refutation was performed, and essentially the same refutation can be 
carried out in vastly different time depending on the space constraints (as is shown in this paper). Following 
the exposition in [ET01 1, therefore, we view a resolution refutation as a Turing machine computation, with 
a special read-only input tape from which the axioms can be downloaded and a working memory where all 
derivation steps are made. Then the length of a proof is essentially the time of the computation and space 
measures memory consumption. The following definition is the straightforward generalization to d\(k) of 
the space-oriented definition of a refutation from [ABRW02]. 

Definition 3.2 (Derivation). A k-DNF configuration D, or, simply, a configuration, is a set of fc-DNF for- 
mulas. A sequence of configurations {Deb • • • > ^r} is said to be a 9\(k)-derivation from a CNF formula F 
if D = and for all i G [t], the set D t is obtained from Bf_i by one of the following derivation steps: 

Axiom Download H>t = EDt-i U {C} for some C G F. 

Inference Dj = Dt-i U {D} for some D inferred by one of the inference rules listed in Definition B. ll from 
a set of assumptions that belongs to ©t_i. 

Erasure D t = D t _i \ {D} for some D G O t _i. 

A 9l(/c)-derivation ir : F h W of a £>DNF set D' from a formula F is a derivation ir = {Do, . . . , D T } such 
that D T = W. A JH(fe) -refutation of F is a 9*(£;)-derivation of the empty DNF (denoted by 0), i.e., the DNF 
formula with no terms, or, phrased differently, the unsatisfiable empty disjunction. 

When the derived A;-DNF set W contains a single formula D, we will often abuse notation slightly by 
writing simply n : F \- D instead of it : F h {D}. 
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Definition 3.3 (Refutation length and space). The formula space, or simply space, of a configuration D 
is its size |D|. The variable spac^of D, denoted VarSp(U>), is the number of literals appearing in B, i.e., 
VarSp(p) = | Vors(D)| and the total space of D, denoted TotSp{p) is the number of literals appearing in 
D counted with repetitions. (Notice that TotSp(B) > VarSp(B).) 

The length of a 9l(&;)-derivation 7r is the number of axiom downloads and inference steps in it0 The 
space (variable space, total space, respectively) of a derivation ir is defined as the maximal space (variable 
space, total space, respectively) of a configuration in tt. If 7r is a derivation of D from a formula F of length 
L and space s then we say D can be derived from F in length L and space s simultaneously. 

We define the Oi(k) -refutation length of a formula F, denoted L^ k ^{F h 0), to be the minimum length 
of any %K(k) -refutation of it. The SH(fc) -refutation space of F, denoted Sp^ k ^(F h 0), the 9t(fc) -refutation 
total space TotSp^t k \{F h 0), and the -refutation variable space of F, denoted VarSp^^) (-F h 0), 
are analogously defined by taking minima over all !tK(/c)-refutations of F. (When the proof system 9i(k) in 
question is clear from context, we will drop the subindex in the proof complexity measures.) 

Notice that the system 9^(1) = is the usual resolution proof system. We remark that in resolution, the 
A-introduction and A-elimination rules do not apply, and the cut rule reduces to the familiar resolution rule 
saying that the clauses C\ V x and C2 V ->x can be combined to derive C\ V C2. The formula space measure 
in resolution is known as clause space. For resolution, it is also convenient to define the auxiliary measure 
of width. 

Definition 3.4 (Resolution width). The width W{C) of a clause C is the number of literals in it and the 
width W(F) of a formula F is the size of a widest clause in F. The width W(tt) of a derivation it is defined 
in the same way, and W(F h 0) denotes the minimum width of any resolution refutation of F. 

Using the same terminology as in [ABRW02|, we also define a stronger version of the fc-DNF resolution 
proof systems. 

Definition 3.5 (Semantic fc-DNF resolution). The semantic k-DNF resolution proof systems are defined 
as in Definitions 13. II and 13.21 but with the modification that the inference rule is that any k-DNF formula 
implied by Oj can be derived at time t + 1. We refer to the systems in Definitions 13.11 and 13 . 21 as syntactic to 
distinguish them from the semantic ones. 

Clearly, semantic systems are much stronger than syntactic ones. In particular, any unsatisfiable CNF 
formula with m clauses has a semantic resolution refutation of length m + 1, since we can just list all 
clauses in the formula and then derive the empty clause in one step (which is implied since the formula is 
unsatisfiable). Note, however, that this refutation also has space m + 1. This is in stark contrast to the fact 
that there are exponential lower bounds known on length for /c-DNF resolution for any k. The reason that 
we distinguish between semantic and syntactic proof systems is that all our upper bounds in the trade-offs 
are in terms of syntactic systems (i.e., the usual ones) whereas the lower bounds in the trade-offs hold even 
for semantic proof systems. 

3.3 Some Structural Results for Resolution 

Although the weakening rule is sometimes convenient for technical reasons, for the case of standard resolu- 
tion it is easy to show that any weakening steps can always be eliminated from a refutation of an unsatifiable 

6 We remark that there is some terminological confusion in the literature here. The term "variable space" has also been used in 
previous papers (including by the current authors) to refer to what is here called "total space." The terminology adopted in this paper 
is due to Hertel and Urquhart (see | Her08 1), and we feel that although this naming convention is as of yet less well-established, it 
feels much more natural than the alternative. 

7 As noted above, the reader who so prefers can instead define the length of a derivation it — {Do, . . . , D r } as the number of 
steps r in it, since the difference is at most a factor of 2. We have chosen the definition above for consistency with previous papers 
defining length as the number of clauses in a listing of the derivation. 
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CNF formula without changing anything essential. Thus, while the results in this paper will be stated for 
resolution with the weakening rule, they also hold for resolution refutations using only axiom downloads, 
resolution rule applications and erasures. Let us highlight this fact in a (somewhat) formal proposition for 
the record. 

Proposition 3.6. Any resolution refutation n : F h using the weakening rule can be transformed into a 
refutation n' : F h without weakening in at most the same length, width, clause space, total space, and 
variable space, and performing at most the same number of axiom downloads, inferences and erasures as n. 

The proof of Proposition l3.6l is an easy forward induction over the resolution refutation (simply ignoring 
all weakening moves and keeping the subclauses instead, which can never increase any measure we are 
interested in). We omit the details. 

Another convenient fact is that restrictions preserve resolution refutations in the following sense. 

Proposition 3.7. If tt is a resolution refutation of F and p is a restriction on Vars (F), then ir \ can be 
transformed into a resolution refutation of F\ p in at most the same length, width, clause space, total space, 
and variable space as it. 

In fact, 7r\ p is a refutation of F\ p (removing all trivially true clauses), but possibly using weakening. 
The proof of this is again an easy induction over n. 

We will also make use of the implicational completeness of resolution. Formally, this means that if C is 
a set of clauses and C is a clause, then C 1= C if and only if there exists a resolution derivation of C from C. 

Proposition 3.8. Suppose C is a set of clauses and C is a clause, both over a set of variables of size n. Then 
C\= C if and only if there exists a resolution derivation of C from C. Furthermore, ifC can be derived from 
C then it can be derived in length at most 2 n+1 — 1 and total space at most n[n + 2) simultaneously. 

Proof sketch. Suppose first that C = is the contradictory empty clause. Build a search tree where all 
vertices on level i query the ith variable and where we go to the left, say, if the variable is false under a given 
truth value assignment a and to the right if the variable is true. As soon as some clause in C is falsified by 
the partial assignment defined by the path to a vertex, we make that vertex into a leaf labelled by that clause. 
This tree has height h < n and hence size at most 2 h+1 — 1, and if we turn it upside down we can obtain a 
legal tree-like refutation (without weakening) of C in this length. This refutation can be carried out in clause 
space h + 2 and total space upper-bounded by the clause space times the number of distinct variables, i.e., 
at most n(n + 2). (We refer to, for instance, [Ben09 ET01 ] for more details.) 

If C ^ 0, apply the unique minimal restriction p falsifying C. Then C\ p 1= C\ p = 0, and we can 
construct a refutation of C L from a search tree of height h < n, since C L contains strictly fewer variables 
than C. Removing the restriction p from this refutation, and adding at most one extra weakening step for 
every other derivation step (this is an example of where the weakening rule comes in handy), we get a 
derivation of C from C. (See [BW01 ] for a formal proof of this fact.) This derivation has length at most 
2 • (2 h+l - 1) < 2" +1 - 1 and total space at most n(h + 2) < n(n + 2). □ 

In a resolution refutation of a formula F, there is nothing in Definition l3.2l that rules out that completely 
unnecessary derivation steps are made on the way, such as axioms being downloaded and then immediately 
erased again, or entire subderivations being made to no use. In our constructions it will be important that we 
can rule out some redundancies and enforce the following requirements for any resolution refutation: 

• Every clause in memory is used in an inference step before being erased. 

• Every clause is erased from memory immediately after having been used for the last time. 
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We say that a resolution refutation that meets these requirements is frugal. The formal definition, which is a 
mildly modified version of that in [Ben09], follows. 

Definition 3.9 (Frugal refutation). Let tt = {Co = 0, Ci, . . . , C T = {0}} be a resolution refutation of 
some CNF formula F. The essential clauses in tt are defined by backward induction: 

• If Ct is the first configuration containing 0, then is essential at time t. 

• If D G Ct is essential and is inferred at time t from C\, C% G Q-i by resolution, then C\ and C 2 are 
essential at time t — 1. 

• If D is essential at time t and D G Q-i, then D is essential at time t — 1. 

Essential clause configurations are defined by forward induction over 7T. The configuration Ci G 7r is 
essential if all clauses D G Q are essential at time t, if Q is obtained by inference of an essential clause 
from a configuration Ct_i containing only essential clauses at time t — 1, or if Ct is obtained from an 
essential configuration Ct_i by an erasure step. Finally, tt = {Co, • • • ,C r } is & frugal refutation if all 
configurations Ci G tt are essential. 

Without loss of generality, we can always assume that resolution refutations are frugal. 

Lemma 3.10. Any resolution refutation tt : F h can be converted into a frugal refutation tt' : F h w/?/z- 
om£ increasing the length, width, clause space, total space, or variable space. Furthermore, the axiom 
downloads, inferences and erasures performed in tt' are a subset of those in tt. 

Proof. The construction of tt' is by backward induction over tt. Set s = mm{t : G Ct} and C' s = {0}. 
Assume that C' s , C' s _ 1 , . . . C' t+2 , C' t+1 have been constructed and consider Ct and the transition Ct Cf+i. 

Axiom Download Q+i = Ct U {C}: Set C£ = C' t+1 \ {C}. (If C is not essential we get C' t = C£ +1 .) 

Erasure Q+i = Ct \ {D}: Ignore, i.e., set C' t = C' t+1 . 

Inference Cf+i = Ct U {D} inferred from Ci,C2 G Ct: If D C t+1 , ignore the step and set C' t = 
C' t+1 . Otherwise (using fractional time steps for notational convenience) insert the configurations 

a t = c t+1 u {c u c 2 } \ {d}, c; +| = c t+1 u {c 1: c 2 }, c t+l = c t+1 u {c 2 }. 

Finally go through tt' and eliminate any consecutive duplicate clause configurations. 

It is straightforward to check that tt' is a legal resolution refutation. Let us verify that tt' is frugal. By 
backward induction, each C' t for integral time steps t contains only essential clauses. By forward induction, 
if C' t+l = C' t U {C} is obtained by axiom download, all clauses in C' t+1 are essential. Erasures in tt are 
ignored. For inference steps, C' t contains only essential clauses by induction, C' - L is essential by inference, 

and C' t+2 and C' t+1 are essential since they are derived by erasure from essential configurations. Finally, it 

is clear that tt' performs a subset of the derivation steps in tt and that the length, width, and space does not 
increase. □ 

3.4 Pebble Games and Pebbling Contradictions 

Pebble games were devised for studying programming languages and compiler construction, but have found 
a variety of applications in computational complexity theory. In connection with resolution, pebble games 
have been employed both to analyze resolution derivations with respect to how much memory they consume 
(using the original definition of space in [ET01 1) and to construct CNF formulas which are hard for differ- 
ent variants of resolution in various respects (see for example [AJPU02, BIW04, BEGJ00 BP03| and the 
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sequence of papers I Nor09al INH08I IBN0 8 1 leading up to this work). An excellent survey of pebbling up to 
ca. 1980 is [Pip80]. We also refer the interested reader to the upcoming survey IN orlOaL which contains 
some later results and also describes connections between pebbling and proof complexity. 

The black pebbling price of a DAG G captures the memory space, i.e., the number of registers, required 
to perform the deterministic computation described by G. The space of a non-deterministic computation is 
measured by the black-white pebbling price of G. We say that vertices of G with indegree are sources and 
that vertices with outdegree are sinks (or targets). In the following, unless otherwise stated we will assume 
that all DAGs under discussion have a unique sink and this sink will always be denoted z. The next definition 
is adapted from [CS76|, though we use the established pebbling terminology introduced by IHP V77I . 

Definition 3.11 (Black-white pebble game). Suppose that G is a DAG with sources S and a unique sink z. 
The black-white pebble game on G is the following one-player game. At any point in the game, there are 
black and white pebbles placed on some vertices of G, at most one pebble per vertex. A pebble configuration 
is a pair of subsets P = (B,W) of V(G), comprising the black-pebbled vertices B and white-pebbled 
vertices W. The rules of the game are as follows: 

1. If all immediate predecessors of an empty vertex v have pebbles on them, a black pebble may be 
placed on v. In particular, a black pebble can always be placed on any vertex in S. 

2. A black pebble may be removed from any vertex at any time. 

3. A white pebble may be placed on any empty vertex at any time. 

4. If all immediate predecessors of a white-pebbled vertex v have pebbles on them, the white pebble on 
v may be removed. In particular, a white pebble can always be removed from a source vertex. 

A black-white pebbling from (Bo, Wo) to (B T , W T ) in G is a sequence of pebble configurations V = 
{P , ■ • • ,P T } such that P = (B , Wo), P r = (B T , W T ), and for all t e [r], F t follows from P t _i by one of 
the rules above. A (complete) pebbling of G, also called a pebbling strategy for G, is a pebbling such that 
(Bo, Wq) = (0, 0) and (B T , W T ) = ({z}, 0). 

The time of a pebbling V = {Po, • • • ,P r } is simply time(V) = r and the space is space(V) = 
maxo<t< T {\Bt U Wt\}. The black-white pebbling price (also known as the pebbling measure or pebbling 
number) of G, denoted BW-Peb(G), is the minimum space of any complete pebbling of G. 

A black pebbling is a pebbling using black pebbles only, i.e., having Wt = for all t. The (black) 
pebbling price of G, denoted Peb(G), is the minimum space of any complete black pebbling of G. 

For any DAG G over n vertices with bounded indegree, the black pebbling price (and thus also the 
black- white pebbling price) is at most 0(n/logn) [HPV77|, where the hidden constant depends on the 
indegree. A number of exact or asymptotically tight bounds on different graph families have been proven in 
the whole range from constant to @(n/ log n), for instance in IGT781 lKla85l ILT801 |Mey 8 1 1 IPTC771 . As to 
time, obviously any DAG G over n vertices can be pebbled in time 2n — 1, and for all graphs we will study 
this is also a lower bound, so studying the time measure in isolation is not that exciting. A very interesting 
question, however, is how time and space are related in a single pebbling of G if one wants to optimize 
both measures simultaneously, and this question turns out to be intimately connected with proof complexity 
trade-offs. Time-space trade-offs for pebble games have been studied in a long sequence of papers, but for 
our applications we will focus on [CS80, CS82 LT82] and the more recent results in [NorlOb]. Our way of 
relating pebbling trade-offs and proof complexity trade-offs goes via the CNF formulas defined next. 

A pebbling contradiction over a DAG G is a CNF formula that encodes the pebble game on G by 
postulating the sources to be true and the target to be false, and specifying that truth propagates through the 
graph according to the pebbling rules. These formulas have previously been studied more or less implicitly 
in IR M991IBEG J00I before being explicitly defined is IBWTJU. 
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Definition 3.12 (Pebbling contradiction). Suppose that G is a DAG with sources S and a unique sink z. 
Identify every vertex v £ V(G) with a prepositional logic variable v. The pebbling contradiction over G, 
denoted Peb G , is the conjunction of the following clauses: 

• for all s 6 S, a unit clause s (source axioms), 

• For all non-source vertices v with immediate predecessors m,...,U£, the clause u\ V • • • V ui V v 
(pebbling axioms), 

• for the sink z, the unit clause z (target or sink axiom). 

If G has n vertices and maximal indegree t, the formula Peb G is an unsatisfiable (l+Q-CNF formula 
with n + 1 clauses over n variables. 



3.5 Substitution Formulas 



Throughout this paper, we will let fd denote any (non-constant) Boolean function fd : {0, l} d i-> {0, 1} 
of arity d. We use the shorthand x = (x±, . . . ,Xd), so that fd{x) is just an equivalent way of writing 
fd{x\, . . . , Xd). Every function fd(xi, • • • , Xd) is equivalent to a CNF formula over x\,. . . ,Xd with at 
most 2 d clauses. Fix a canonical way to represent functions as CNF formulas and let Cl[f d {x)] denote 
the canonical set of clauses representing fd- Similarly, let Cl[-^f d (x)} denote the clauses in the canonical 
representation of the negation of /. 
For instance, we choose to define 



Cl[y 2 {x)\ = {x 1 V x 2 } and C7[-.V 2 (x)] = {x u x 2 } 
for logical or of two variables and 

Cl[® 2 (x)] = {xi V x 2 , x x V x 2 } and Cl[-^® 2 (x)\ = {x x V x 2 , x x V x 2 } 
for logical exclusive or of two variables. The general definitions for exclusive or are 



and 



Cl[® d (x)\ = {VtXlEti^ ^ d (mod 2)} 
Clh(Bd(x)} = {Vti<lEti^ # d (mod 2)} 



(3.6) 



(3.7) 



(3.8) 



(3.9) 



from which we can see that Cl[(B d {x)] and C7[-i©d(x)] both are <i-CNFs. We will also be interested in the 
threshold function saying that k out of d variables are true, which we will denote thr k d . To give an example, 
for thr\ we have 



Cl[thr 2 4 (x) 



and 



Cl[^thr\(x^ 



' x\ V x 2 V x 3 / 

X\ V X 2 V X4, 

xi V 13 V X4, 
I x 2 V x 3 V x 4 J 



^ xi V x 2 ,' 
xi V x 3 , 
xi V X4, 

X 2 V X3, 

x 2 V x 4 , 
L x 3 V x 4 J 



(3.10) 



(3.11) 
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and in general we have 

Cl[thr k d (x)} = {\f ieS Xi\S C [d], \S\=d-k + l} (3.12) 

and 

Cl[^thr k d (x)\ = {\J ieS Xi\S C [d], \S\ = k} . (3.13) 

Clearly, thr\{x\, . . . , Xd) is just another way of writing the function Vf=i x i> anc ^ ^ r< d{ x ii ■ ■ ■ > x d) = 
All a*. 

In general, we could construct a canonical representation CZ[/ rf (x)] for as follows. For a truth value 
assignment a : {x\, . . . , x^} h-> {0, 1} we define the clause C Q = x} ^ V • • • V x\ a ( Xd > that is true for 
all assignments to x\, ■ ■ ■ ,Xd except a. Then we could define 

ciiw)} = A c « ■ < 3 - 14 ) 

a:a(f d (x))=0 

But this way of representing the Boolean function can turn out to be unnecessarily involved. For instance, 
for binary logical and (13.141) yields Cl[f\2(x)] = {x\ V X2, x\ V X2, x\ V x^\ instead of the arguably more 
natural representation C7[Aa(^)] = {^i, x{\- Therefore, we want the freedom to choose our own canonical 
representation when appropriate. Note, however, that (I3.14t constitutes a proof of the fact that without loss 
of generality we can always assume that | CZ[/ rf (x)]| < 2 d , since there are only 2 d truth value assignments 
and fd is assumed to be non-constant. 

The following observation is rather immediate, but nevertheless it might be helpful to state it explicitly. 

Observation 3.13. Suppose for any non-constant Boolean function fd that C G Cl[fd(x)] and that p is any 
partial truth value assignment such that p(C) = 0. Then for all D S Cl[-^f d (x)] it holds that p(D) = 1. 

Proof. If p(C) = this means that p(fd) = 0. Then clearly p{~^fd) = 1> so, in particular, p must fix all 
clauses D G Cl[-^f d {x)] to true. □ 

We want to define formally what it means to substitute fd for the variables Vars (F) in a CNF formu- 
la F. For notational convenience, we assume that F only has variables x, y, z, et cetera, without subscripts, 
so that x\, . . . , Xd, • • • ,ydi z\, . . . , Zd, ■ ■ ■ are new variables not occurring in F. We will say that the 
variables x±, . . . , Xd, and any literals over these variables, all belong to the variable x. 

Definition 3.14 (Substitution formula). For a positive literal x and a non-constant Boolean function fd, we 
define the fd-substitution of x to be x[fa] = Cl[f d {x)], i.e., the canonical representation of fd{xi, • • • , Xd) 
as a CNF formula. For a negative literal ->y, the /^-substitution is -<y{fd\ = Cl[-^f d {y)]- The /^-substitution 
of a clause C = a\ V • • • V is the CNF formula 

C[f d ]= A ••• A (CiV...VC fc ) (3.15) 

CiEttil/d] C k €a k [f d ] 

and the /^-substitution of a CNF formula F is F[fd] = AcgF C[fd)- 

The reader is reminded that an example of this definition was given in Equation ( 12. II ) on page|5| 
We note that F[fd] is a CNF formula over d- \ Vars(F) \ variables containing strictly less than |F|-2 d ' W ( F ) 
clauses. (Recall that we defined a CNF formula as a set of clauses, which means that \F\ is the number of 
clauses in F.) We have the following easy observation, the proof of which is presented for completeness. 

Observation 3.15. For any non-constant Boolean function fd : {0, l} d h-> {0,1}, it holds that F[fd] is 
unsatisfiable if and only if F is unsatisfiable. 
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Proof. Suppose that F is satisfiable and let a be a truth value assignment such that a(F) = 1. Then we can 
satisfy F[fd] by choosing an assignment a' for Vars(F[fM in such a way that fd(a'(x%), . . . , a'(xd)) = 
a(x). For if C G F is satisfied by some literal a, t set to true by a, then a' will satisfy all clauses Cj £ a, [/<J 
and thus also the whole CNF formula C[fj\ in (I3.15t . 

Conversely, suppose F is unsatisfiable and consider any truth value assignment a' for F[fd\. Then a' 
defines a truth value assignment a for F in the natural way by setting a(x) = fd{a'{xi), ■ ■ ■ , &'(xd)), and 
we know that there is some clause C G F that is not satisfied by a. That is, for every literal ai G C = 
a\ V • • • V afc it holds that a(aj) = 0. But then a' does not satisfy ai[fd], so there is some clause C- G a^f/d] 
such that a' (CD = 0. This shows that a' falsifies the disjunction C[ V • • • V C' k G C[/d], and consequently 
must also be unsatisfiable. □ 

For our present purposes, a particularly interesting kind of Boolean functions f(x\, . . . , Xd) are those 
having the property that no subset of k variables can determine the value of f(x%, . . . , Xd)- 

Definition 3.16 (Non-authoritarian function). We say that a Boolean function / over variables X = 
{xi, . . . , Xd} is k-non-authoritaria^ii no restriction to X of size k can fix the value of /. In other words, 
for every restriction p to X with \p\ < k there exist two assignments ao, ot\ D p such that f(ao) = and 
/(«i) = l- 

Observe that a function on d variables can be fc-non-aufhoritarian only if k < d. As noted above, two 
natural examples of d-non-authoritarian functions are exclusive or © of d + 1 variables and majority of 
2d + 1 variables, i.e., thri\\ . 



4 Substitution Space Theorem for Resolution 

We now present the full proof of our first main technical contribution, which describes how the space com- 
plexity of a formula in resolution changes under substitution. Let us start by restating the theorem for 
reference. 

Theorem 12.11 (restated). Let F be any unsatisfiable CNF formula and fd ■ {0, l} d \-t {0, 1} be any non- 
constant Boolean function. Then it holds that the substitution formula F[fd] can be refuted in standard 
syntactic resolution in width 

W(F[f d ]hO) =0(d- ^h0)) , 

length 

Lm{F[f d ] H)< mm {L(K) • exp(0(d- W(tt)))} , 

7r:r rU 

and total space 

TotSp m (F[f d ] h 0) < min { TotSp(ir) ■ exp(0(d • W(n)))} . 

7r:_FhO 

In the other direction, any semantic resolution refutation TTf : F[fd] h of the substitution formula can be 
transformed into a syntactic resolution refutation ir : F h of the original formula such that the number of 
axiom downloads in tt is at most the number of axiom downloads in TTf. If in addition fd is non-authoritarian, 
it holds that Sp(iTf) > VarSp(ir), i.e., the clause space of refuting the substitution formula F[fd] is lower- 
bounded by the variable space of refuting the original formula F. 

8 Although related notions occur in the literature, we have not been able to locate any previous definition that captures what we 
require in Definition l3.16l We therefore introduce a (hopefully evocative) new name that decribes the property that we need. 
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4.1 Two Corollaries of the Substitution Space Theorem 

Before embarking on the proof of the theorem, we want to make a couple of quick remarks. Although this 
might not be immediately obvious, Theorem I2. H is remarkably powerful as a tool for understanding space 
in resolution. It will take some more work before we can present our main applications of this theorem, 
which are the strong time-space trade-off results discussed in Section Let us note for starters, however, 
that without any extra work we immediately get lower bounds on space. 

Esteban and Toran [ET01 1 proved that the clause space of refuting F is upper-bounded by the formula 
size. In the papers IABRW02. IBG03I IET011 it was shown, using quite elaborate arguments, that there 
are polynomial-size /c-CNF formulas with lower bounds on clause space matching this upper bound up to 
constant factors. Using Theorem I2. II we can get a different proof of this fact. 

Corollary 4.1 ( IABKW021 IBG031 lETOD ). There are families of k-CNF formulas {F}™ =1 with 0(n) 
clauses over O(n) variables such that Sp(F n h 0) = 0(n). 

Proof. Just pick any formula family for which it is shown that any refutation of F n must at some point in the 
refutation mention Q(n) variables at the same time (e.g., from [BW01 1), and then apply Theorem 12. II □ 

It should be noted, though, that when we apply Theorem O the formulas in |ABRW02 BG03l lET0T1l 
are changed. We want to point out that there is another, and even more elegant way to derive Corollary 14.11 
from [BW01 1 without changing the formulas, namely by using the lower bound on clause space in terms of 
width in IAD08I . 

For our next corollary, however, there is no other, simpler way known to prove the same result. Instead, 
our proof in this paper actually improves the constants in the result. 

Corollary 4.2 (|BN08|). There are families {i ? n }^L 1 of k-CNF formulas of size O(n) refutable in linear 
length L(F n h 0) = O(n) and constant width W(F n h 0) = 0(1) such that the minimum clause space 
required is Sp(F n h 0) = Q(n/ logn). 

This follows, just as in |BN08), by considering graphs of size 0(n) with maximal pebbling price 
0(n/logn) and studying refutations of pebbling contradictions over such graphs. Since the variable space 
of such refutations is at least the pebbling price, Corollary l4.2l follows immediately from Theorem |^TJ (and, 
as noted, with better constants than in [BN08|), by using, say, exclusive or over two variables in the substi- 
tution formula. 

4.2 Proof of the Substitution Space Theorem — Main Components 

We divide the proof of Theorem 12. II into three parts in Lemmas 14.3114.61 and !4.7l below. In this subsection, 
we state these three theorems and show how they combine to yield Theorem 12. II The rest of Section 0] is 
then spent proving these three auxiliary lemmas. 

Lemma 4.3. For any CNF formula F and any non-constant Boolean function f d , it holds that 

W(F[f d ]\-0) =0(d- W(F\-Q)) , 
Ln{F[f d ] h 0) < mm {L(tt) ■ exp(0(d- W(tt)))} , 

7v:r r 

and 

TotSp m (F[f d } h 0) < min { TotSp(ir) ■ exp(0(d • W(w)))} . 
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These upper bounds for resolution refutations of F[fd] are not hard to show. The proof proceeds along 
the following lines. Given a resolution refutation n of F, we construct a refutation 7rj : F[fd] l~ mimicking 
the derivation steps in it. When it downloads an axiom C, we download the exp(0(d • W(C))) axiom 
clauses in C[fd]. When 7r resolves C\ V x and C2 V x to derive C\ V C2, we use the fact that resolution is 
implicationally complete to derive {C\ V C2)[fd] from (Ci V x)[f d ] and (C2 V x)[fd] in at most exp(0(d • 
W(C\ V C2))) steps. We return to the details of the proof in Section l4~3l 

Before stating the next lemma, we need to give a slightly more formal description than in Definition ^. 1 II 
of how clauses derived from F[fd] are translated into clauses over Vars(F) using projections defined in 
terms of precise implication. 

Definition 4.4 (Precise implication). Let F be a CNF formula and fd a non-constant Boolean function, 
and suppose that D is a set of clauses over Vars(F[fd\) and that P and N are (disjoint) subset of variables 
of F. If 

\/ f d (x)V \/ ^f d {y) (4.1a) 
ieP y&N 

but for all strict subsets P' C p and N' g N it holds that 

®p V ^( f ) v V > ( 4 - lb ) 

xeP' yeN 

and 

V f^)y V -/d(i/) , ( 4 - lc ) 

xeP y&N' 

we say that the clause set D implies \J xeP fd(x) V \J yeN ~<fd(y) precisely and write 

D > \/ f d (x) V \/ ^f d (y) . (4.2) 

x&P y£N 

Note that P = N = in Defmition l4.4l corresponds to D being unsatisfiable. 

Recalling the convention that any clause C can be written C = C + V C~, where C + = \l x& iit{c) x ^ s 
the disjunction of the positive literals in C and C~ = Vyeiit(c) V ^ s tne disjunction of the negative literals, 
and choosing not to write x G Lit (C + ) or y e Lit (C~) below, but instead x € C + and y € for brevity 
(which is still formally correct since a clause is a set of literals), we define projection of clauses as follows. 

Definition 4.5 (Projected clauses). Let F be a CNF formula and fd a non-constant Boolean function, and 
suppose that D is a set of clauses derived from F[fd\. Then we say that O projects the clause C = C + V C~ 
if there is a subset C D such that 

»cO V fd{x)V V ^fdiv) (4-3) 

and we write 

proj F (p) = {C |3D C C D s.t. D c > \J x£C+ f d (x) V Vpec-" 1 /^^} ( 4 - 4 ) 

to denote the set of all clauses that D projects on F. 

Given Definitions 14.41 and 14.51 which tell us how to translate clauses derived from F[fd] into clauses 
over Vars(F), the next step is to show that this translation preserves resolution refutations. 
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Lemma 4.6. Suppose that itj = {Dq> • • • i^t} is a semantic resolution refutation of F[f d ]for some arbi- 
trary unsatisfiable CNF formula F and some arbitrary non-constant function f d . Then the sets of projected 
clauses {proj f (Dq), . . . , pro j ^(D T )| form the "backbone" of a resolution refutation ir of F in the sense 
that: 

• pwj F (D ) = 0. 

• proj F (B T ) = {0}. 

• All transitions from proj F (Bt-i) to proj F (Bt) for t E [r] can be accomplished by axiom downloads 
from F, inferences, erasures, and possibly weakening steps in such a way that the variable space in tt 
during these intermediate derivation steps never exceeds maxe) g7r/ { VarSp(proj ^(D))}. 

• The only time tt performs a download of some axiom C in F is when tt f downloads some axiom 
D G C[f d ] in F[f d }. 

Note that by Proposition 13.61 we can get rid of the weakening moves in a postprocessing step, but 
allowing them in the statement of Lemma l4~6l makes the proof much cleaner. Accepting Lemma l4~6l on faith 
for the moment (deferring the proof to Section l4~4l >. the final missing link in the proof of the Substitution 
Space Theorem is the following lower bound. 

Lemma 4.7. Suppose that D ^ is a set of clauses over Vars(F[f d ]) for some arbitrary CNF formula F 
and some non-authoritarian function f d . Then Sp(D) = |D| > VarSp(proj F (D)). 

Combining Lemmas 14.31 14.61 and 14.71 the substitution space theorem for resolution follows. This is 
immediate, but for the convenience of the reader we write out the details. 

Proof of Theorem 1X71 The first part of Theorem 12.11 i.e., that any refutation tt of F can be converted to a 
refutation ttj of the substitution formula F[f d ], is Lemma l4~3*1 verbatim. For the second part of Theorem l2.ll 
Lemma l4~6l describes how any refutation 7rj of the substitution formula F[f(i\ can be translated back into 
a refutation tt of the original formula F. This is true regardless of what kind of function is used for the 
substitution. If in addition fd is non-authoritarian, Lemma l4~7l savs that the clause space of ttj provides an 
upper bound for the variable space of tt. The theorem follows. □ 

It remains to prove Lemmas 14.31 14.61 and 14.71 For convenience of notation in the proofs, let us define 
the disjunction C V D of two clause sets C and D to be the clause set 

CVD = {CVD|CeC,DGD} . (4.5) 

This notation extends to more than two clause sets in the natural way. Rewriting (I3.15t in Definition 13.141 
using this notation, we have that 

(D V a)[f d ] = D[f d ] V a[f d ] = f\ /\ (Ci V C 2 ) . (4.6) 

Ci£D[f d ]C 2 &a[f d ] 

4.3 Refuting Substitution Formulas F[f d ] by Simulating Refutations of F 

To prove Lemma l4~3l we show how to construct, given tt : F\- 0, a resolution refutation ttj : D[f d ] h by 
maintaining the invariant that if we have C in memory for tt, then we have C[f d ] in memory for TTf and this 
configuration has total space at most X^cec ^ ^ — |C|2 rf ' w( - n \ We get the following case analysis. 

Axiom download If tt downloads C, we download all of C[f d ], i.e., less than 2 d ' w ^ clauses which all 
have width at most d • W(C). 
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Erasure If n erases C, we erase all of C[fd] in less than 2 d ' w ( c ) erasure steps. 

Inference This is the only interesting case. Suppose that it infers C\ V C 2 from C\ V x and C 2 V x. Then 
by induction we have (Ci V x)[fd] and (C 2 V 3f)[/d] in memory in 7r/. It is a straightforward extension 
of Observation I3T51 that if C t= D, then C[f d ] t= !?[/*], so in particular it holds that (d V x)[f d ] and 
(C2 V x) [fd] imply (Ci V C2) [/J. By the implicational completeness of resolution, these clauses can all be 
derived. 

An upper bound (not necessarily tight) for the width of this derivation in ttj is d • (W(C\ V x) + 
^(C 2 VS)+ W(Ci V C 2 )) = 0(d ■ W{tt)) , as claimed. 

To bound the length, note that (C% V C 2 )[/d]. contains less than 2 d ' w ( ClVC ^ clauses. For every clause 
D € {C\ V C 2 )[/d], consider the minimal restriction p{~^D) falsifying D. Since 

(Ci V x) [fd] A (C2 V 2?) [/d] t= -D (4.7) 

we have that 

(Ci V x) [/ d ] \ phD) A (C 2 V 5?) [f d ] \ phD) t= . (4.8) 

The number of variables is at most ci • ( W{C\ V C2) + 1) = N, and by Proposition l3.8l there is a refutation of 
(Ci V x) [f d ] \ phD) A (C 2 V 2c) t p( ^ jD ) in length at most 2 N+1 - 1 and total space at most 2 N+1 (2 N+1 + 
2) = 2°^. Looking at this refutation and removing the restriction p(->D), it is straightforward to verify 
that we get a derivation of D from (Ci V x)[fd] A (C 2 Vi)[/d] in the same length and same asymptotic 
total space (see, for instance, the inductive proof in [BW01]). We can repeat this for every clause D € 
(Ci V C 2 )[fd] to derive all of the less than 2 d ( w ( Cl vC>2)) clauses in this set in total length at most 

2d-(w{c 1 vc 2 )) . 2 <i-(w(CiVC2)+2) < 2 3d - w M = 2°( d - ^W) (4 9) 

Taken together, we see that we get a refutation 7r/ in length at most L(ir) ■ 2°( d ' w ^)) and width at most 
0(d ■ W(ir)) and total space at most TotSp(ir) ■ exp(0(d • W(tt))). Lemma l431 follows. 

4.4 Translating Refutations of Substitution Formulas F[f d ] to Refutations of F 

We next prove Lemma l4~6l Let us use the convention that D and D denote clause sets and clauses derived 
from F[fd] while C and C denote clause sets and clauses derived from F. 

Let us also overload the notation and write D 1= C, D ¥ C, and D > C for C = C+ V C~ when 
the corresponding implications hold or do not hold for D with respect to \j ' x&c + j ? d(x) V V 1 y^c-^fdiv) 
(with precise implication \> defined as in Definition l4.4l) . Note that it will always be clear when we use the 
notation in this overloaded sense since D and C are defined over different sets of variables. 

Recall from Definition Othat proj F {B) = {C \3B C C D s.t. D c > \J x&c+ f d {x) V \J wC -^fd{v)} 
is the set of clauses projected by D. In the spirit of the notational convention just introduced, we will let Cj 
be a shorthand for proj F (Pt). 

Suppose now that ttj = [Bq, . . . , O r } is a resolution refutation of F[fd] for some arbitrary unsatisfiable 
CNF formula F and some arbitrary non-constant function fd. 

The first two bullets in Lemma l4~6l are immediate. For Do = we have Co = proj f (Bq) = 0, and it is 
easy to verify that D T = {0} yields C T = proj F (H> T ) = {0}. As an aside, we note that the empty clause 
will have appeared in Q = proj F (B>t) earlier, namely for the first t such that 3 t is contradictory. 

The tricky part is to show that all transitions from Q_i = proj F Cp t -i) to Q = proj F (H>t) can be 
performed in such a way that the variable space in our refutation under construction it : F V- never exceeds 
max{ VarSp(Ct-i), VarSp(Ct)} during the intermediate derivation steps needed in n. The proof is by 
a case analysis of the derivation steps. Before plunging into the proof, let us make a simple but useful 
observation. 
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Observation 4.8. Using the overloaded notation just introduced, ifD t \= C then C = C + V C is derivable 
from Ct = proj F (Ot) by weakening. 

Proof. Pick Cf C C+ and Cf Q C~ minimal so that D 1= Cf V C 2 ~ still holds. Then by definition 
B t> V C2 so V C2 G Cj and C D V can be derived from Ct by weakening as claimed. □ 

Consider now the rule applied in -Kf at time t to get from Bt_i to B t . We analyze the three possible 
cases — inference, erasure and axiom download — in this order. 

Inference Suppose D t = B t _i U {D}. Since B t -i C D t , it is immediate from Definition |43] that no 
clauses in Q-i can disappear at time t, i.e., Q_i \ Ct = 0. There can appear new clauses in Ct, but by 
Observation 14. 81 all such clauses are derivable by weakening from Ct-i since Ot-i implies D and hence 
all of Df During such weakening moves the variable space increases monotonically and is bounded from 
above by VarSp (Ct). 

Erasure Since B t ^ D^t-ii it is immediate from Definition 14 . 5 1 that no new clauses can appear at time t. 
Any clauses in Q-i \ Cf can simply be erased, which decreases the variable space monotonically. 

Axiom download This is the only place in the case analysis where we need to do some work. Suppose 
that Bf = Bf_i U {D} for some axiom clause D G ^4[/d], where A in turn is an axiom of F. If C G Ct\Cf_i 
is a new projected clause, D must be involved in projecting it so there is some subset B C Bt— 1 such that 

B U {D} > C . (4.10) 

Also note that if Bt_i N C we are done since C can be derived from Q-i by weakening, so we can assume 
that 

Bt_i P C . (4.11) 

We want to show that all clauses C satisfying (14.101) and (I4.11t can be derived from Ct-i = proj ^(Bt-i) 
by downloading A G F, making inferences, and then possibly erasing A, and that this can be done without 
the variable space exceeding max{ VarSp (Cf_i), VarSp(Ct)}. The key to our proof is the next lemma. 

Lemma 4.9. Suppose that the clause set B over Vars(F[f,i\), the clause D G ^4[/d], and the clause C over 
Vars(F) are such that B U {D} [> C but B ¥ C. Then if we write the clause A as A = a\ V • • • V a^, 
it holds for every a« G A \ C that there is a clause subset W C B and a subclause C % C C such that 
W t> C l V Hi. That is, all clauses C V Hi for a>i G A\C can be derived from C = proj F (D) by weakening. 

Proof. Consider any truth value assignment a such that a(B) = 1 but a(\J x&c+ f d (x) V \/y & c-^fd(y)) = 
0. Such an assignment exists since B ¥ C by assumption. Also, since by assumption B U {D} > C we 
must have a(D) = 0. If A = a± V • • ■ V a^, we can write D G ^4[/d] on the form D = D\ V • • • V 
for Di G ai{fd] (compare with Equation (I4.6I) ). Fix any a G A and suppose for the moment that a = x 
is a positive literal. Then a(Di) = implies that a(f d {x)) = 0. By Observation 13.131 this means that 
a(-if d (x)) = 1. Since exactly the same argument holds if a = y is a negative literal, we conclude that 

or, rewriting (I4.12t using our overloaded notation, that 

DNCVSi . (4.13) 

If ai G C, the clause C V is trivially true and thus uninteresting, but otherwise we pick W C B and 
C ! C C minimal such that d4.13t still holds (and notice that since B ¥ C, the literal Hi cannot be dropped 
from the implication). Then by Definition [O] we have B* > C l V Hi as claimed. □ 
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We remark that Lemma l4.9l can be seen to imply that Vars(A) C Vars(Ct) = Vars(proj F (H>t)). For 
x G Vars(yl) n Vars(C) this is of course trivially true, but for x G Vars(A) \ Fors(C) Lemma l4~9l 
tells us that already at timet — 1, there is a clause in Q_i = proj^(ID) t _i) containing x, namely the 
clause C % V Oj found in the proof above. Since D t D ©t-i, this clause does not disappear at time t. This 
means that if we download A G F in our refutation 7r : F h under construction, we have Far5p(Ct_i U 
{.A}) < Var5p(Ct). Thus, we can download A G F, and then possibly erase this clause again at the end 
of our intermediate resolution derivation to get from Q_i to Q, without the variable space ever exceeding 
max{ Var5p(Ct_i), Var5 , p(Q)}. 

Let us now argue that all new clauses C G Q \ Q_i can be derived from Q_i U {A}. If A \ C = 0, 
then the weakening rule applied on A is enough. Suppose therefore that this is not the case and let A' = 
A\C = \/ a £Lit(A)\Lit(C) a - Appealing to Lemma l4~9l we know that for every a G A there is a C a Q C 
such that C a V a G Ct-i- Note that by assumption (14.111) this means that if x G Vars(A) n Vars(C), then 
x occurs with the same sign in A and C, since otherwise we would get the contradiction O 1= C V a = C. 
Summing up, C<_i contains C a V a for some C„CC for all a G Lit(A) \ Lit(C) and in addition we know 
that Lit (A) n {a | a G Lit(C)} = 0. Let us write A' = a\ V • • • V a m and do the following weakening 
derivation steps from Q_i U {A}: 

A^C\J A 1 
C ai V Si C V Oi 

C a2 Va 2 ^CV a 2 (4_ 14) 

Then resolve C V A' in turn with all clauses C V 01, C V 02, . . . , C am V a m , finally yielding the clause C. 

In this way all clauses C G Q \ Ct_j can be derived one by one, and we note that we never mention 
any variables outside of Vars (Cf_i U {^4}) C Vars(Ci) in these derivations. 

Wrapping up the Proof Of Lemma 14.61 We have proven that no matter what derivation step is made 
in the transition D<_i Dj, we can perform the corresponding transition Q_i ~-> Cj for our projected 
clause sets without the variable space going above max{ VarSp(Ct~i), VarSp(Ct)}. Also, the only time 
we need to download an axiom A G F in our projected refutation 7r of F is when ttj downloads some axiom 
D G -A[/d]. This completes the proof of Lemma l4.6l 

4.5 Lifting Variable Space Lower Bounds to Clause Space Lower Bounds 

Finally, the turn has come to the proof of Lemma 14/71 Recall the convention that x,y,z refer to variables 
in F while x\, . . . , Xd, yi, ■ ■ ■ ,yd,Zi,. . . ,Zd refer to variables in F[fd]. Also recall that we use overloaded 
notation D N C, D ¥ C, and D > C for C = C + V C~ (where C+ = V xeC x and c ~ = Vygc when 
the corresponding implications hold or do not hold for D with respect to \J xe c+fd(%) V \Jy & c-^fd{y)- 

We start with an intuitively plausible lemma saying that for all variables a; appearing in some clause 
projected by D, the clause set D itself must contain at least one of the variables x\, . . . , a^. 

Lemma 4.10. Suppose that D /s a ^ef of clauses derived from F[fd) and that C G proj F (D). Then for all 
variables x G Vars{C) it holds that {x±, . . . , Xd} D Vars(D) 7^ 0. 

Proof. Fix any D'Cl such that D precisely implies C in the sense of Definition l4.4l By this definition, for 
all z G Vars(C) we have W ¥■ C \ {z,z}. Suppose that z appears as a positive literal in C (the case of a 
negative literal is completely analogous). This means that there is an assignment a such that a(W) = 1 but 
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a {Vx£C+\{z}fd(x) v VyeC-^fd(y)) = 0- Since n ' > C, it must hold that a(f d (z)) = 1. Modify a into 
a 1 by changing the assignments to z\, . . . , z d in such a way that a'(f d (z)) = 0. Then a' (\f xeC +f d (x) V 
\l y& c-^fd{y)) = 0> so we must have a'(B') = 0. Since we only changed the assignments to (a subset of) 
the variables Z\, . . . , z d , the clause set D'CB must mention at least one of these variables. □ 

With Lemma 14.101 in hand, we are ready to prove Lemma |4~71 Note that everything said so far in 
Section |4] (in particular, all of the proofs) applies to any non-constant Boolean function. In the proof of 
Lemma l4~71 however, it will be essential that we are dealing with non-authoritarian functions, i.e., functions 
fd having the property that no single variable Xi can fix the the value of /^(xi, • • • , x d ). 

Suppose that B is a set of clauses derived from F[fd] and write V* = Vars(proj F (B)) to denote the set 
of all variables in Vars(F) appearing in any clause projected by B. We want to prove that Sp(B) = |B| > 
| V* | provided that fd is non-authoritarian. 

To this end, consider the bipartite graph with the clauses in B labelling the vertices on the left-hand side 
and variables in V* labelling the vertices on the right-hand side. We draw an edge between D G B and 
x £ V* if Vars(D) D {x%, . . . ,x d } ^ 0. By Lemma l4~T0l it holds that Vars(JD>) D {x\, . . . ,x d } ^ for all 
variables x G K*, so in particular every variable x G V* is the neighbour of at least one clause fl£D. Let 
us write N(D) to denote the neighbours of a left-hand vertex D and extend this notation to sets of vertices 
by taking unions. 

We claim that if V* = Vars(proj F (B)) / 0, then there must exist some clause set D'CI satisfying 
|B'| > N(W). Suppose on the contrary that |B'| < N(W) for all W C D. Then by Hall's marriage theorem 
there is a matching of the clauses in D into the variable set V*. Assume that C = C + V C~ is any clause 
projected by D (such a clause exists since V* ^ 0). Then surely 

E> N V xe c + fd@) V \Jy EC -^fd(y) (4-15) 

(there is even a subset of D such that this implication is precise). But using the matching between D and V*, 
we can satisfy O without assigning values to more than one variable Xi G Vars(D) corresponding to any 
x G Vars(F). Since fd is non-authoritarian, we can then extend this assignment to another assignment 
falsifying fd(x) for all x G C + and satisfying fd(y) for all y G C~. This means that our assignment 
satisfies the left-hand side of the implication (14.151) but falsifies the right-hand side, which is a contradiction. 
The claim follows. 

Hence, fix any largest subset Di C D such that |Di| > N(Bx). Clearly, if Di = D we are done 
(remember that N(B>) = V*), so suppose Di ^ D. In much the same way as above, we show that this 
assumption leads to a contradiction. 

Let D 2 = D \ Di f and define the vertex sets V* = iV(Di) and V 2 * = V* \ V{. Note that we 
must have V 2 * C N(B 2 ) since N(B) = N(Bi) U N(B 2 ) = V*. By the maximally of Di it must hold 
for all W C B 2 that |B'| < \N(W) \ V{\, because otherwise D" = »i U D' would be a larger set with 
|B"| > |iV(B")|. But this implies that, again by Hall's marriage theorem, there is a matching M of B 2 into 
N(B 2 ) \ V{ = V 2 *. Consider any clause C G proj F {B) such that Vars(C) n V 2 * ^ and let D'CDbe 
any clause set such that 

®'> V x ec+fd@)vV V ec-^fd(v) (4-16) 
(the existence of which is guaranteed by Definition I4.5I ). We claim that we can construct an assignment a 
that makes B' true but VxeC+/d(^) V Myec-^fdiv) false. This is clearly a contradiction, so if we can 
prove this claim it follows that our assumption Bi ^ B is false and that it instead must hold that Bi = B 
and thus |iV(B)| = \V*\ < |B|, which proves the theorem. 

To establish the claim, let B£ = B' n D» for i = 1, 2 and let C< = Cf V Cr for 

Cf=\jx and Cr= \J y (4.17) 

xeC y&C 

xav* y£ v* 
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and i = 1,2. We construct the assignment a satisfying W but falsifying \J ' x&c + j 'd{x) V My^c-^fdiv) m 
three steps: 

1. Since V C t ~ = Ci ^ C by construction (recall that we chose our clause C in such a way that 
Vars{C) n V 2 * 7^ tne minimality condition in Definition 14.51 yields that 

D i * VxeC+fdW V \l wc ^fd{y) (4-18) 

and hence we can find a truth value assignment oc\ that sets D' x to true, all fd{x±, ■ ■ ■ ,%d), x G 
Ci , to false, and all f^(yi, ■ ■ ■ , yd), V £ , to true. Note that ol\ need only assign values to 
{zi, ...,Zd\ze Vars(Ci)}. 

2. For D' 2 , we use the matching M into y 2 * found above to pick a distinct variable x(D) G Vars(F) for 
every D G D 2 an d then a variable x(D)i G Vars(F[/rf]) appearing in D, the existence of which is 
guaranteed by the edge between D and x{D). Let 0:2 be the assignment that sets all these variables 
x(D)i to the values that fix all D G W 2 to true. We stress that CK2 assigns a value to at most one 
variable x{D)i for every x(D) G Vars^). 

3. But since fd is non-authoritarian, this means that we can extend 02 to an assignment to all vari- 
ables x(D)i, . . . , x(D)d that still satisfies D' 2 but sets all fd(xi, • • • , Xd), x G C 2 , to false and all 

fd(yi>--->yd), y e c^, to true. 

Hence, a = «i U «2 is an assignment such that a(D') = 1 but a(\J x( - c+ f d (x) V \/y & c-^fd{y)) = 0> 
which proves the claim. This concludes the proof of Lemma l4~71 

Since Lemmas 14.3114.61 and !4.7l have now all been established, the proof of Theorem l2.1l is complete. 

5 Substitution Space Theorem for A -DNF Resolution 

To extend the substitution space theorem to /c-DNF resolution, we construct a proof along very much the 
same lines as the proof for standard resolution in Section 0] The details of the proof gets much trickier, 
however, since the ideas used to obtain the (essentially tight) analysis for Theorem 12.11 no longer work. 
Indeed, they provably break down in view of the recent results in [NR09|. Instead, new, somewhat cruder, 
tools have to be developed. 

Before proving the theorem, let us state it in its full generality with explicit constants. To parse the 
theorem more easily, the reader might be helped by thinking of c, d, and k as constants. (This is the way the 
result was stated in Theorem 12.21 and this is also the case that will be of interest in our applications of the 
theorem). 

Theorem 5.1. Let F be any unsatisfiable c-CNF formula and fd be any non-constant Boolean function of 
arity d. Then the following two properties hold for the substitution formula F[fd]: 

1. If F can be refuted in syntactic standard resolution in length L and total space s simultaneously, then 
F[fd] can be refuted in syntactic 9\(d) in length L-d 4 ° -4 cd and total space s-d2 d +(cd+2) 3 -4 cd +0(l) 
simultaneously. 

2. If fd is k-non-authoritarian and F[fd] can be refuted by a semantic 9\(k)- refutation that requires for- 
mula space s and makes L axiom downloads, then F can be refuted by a syntactic standard resolution 
refutation that requires variable space at most (2sk) k+1 • 4 fc d and makes at most L axiom downloads. 
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The proof of the first part of Theorem 15.11 is straightforward, if a bit technically involved, and closely 
follows the corresponding proof for standard resolution. The second part of the proof is also very similar 
up to the point when we need to relate the formula space in 9\(k) -refutations of the substituted formula to 
the variable space of resolution refutations of the original formula. This boils down to bounding how many 
different Boolean functions of a particular form can be implied by a small set of /c-DNF formulas. 

A related, more narrow, problem, although strictly speaking not a special case of this general problem, 
is how many variables can appear in a minimally unsatisfiable set of /c-DNF formulas of a given size (Defi- 
nition l2~T2l on page [121. Although there is no formal connection between the two problems as far as we are 
aware, the proof for standard resolution in Section @]relies heavily on the techniques to prove tight bounds 
for minimally unsatisfiable CNF formulas. In this section, we find that this is the case for /c-DNF resolution 
as well, as our techniques for proving bounds on minimally unsatisfiable /c-DNF sets carry over to the proof 
of the %K{k) substitution space theorem. And interestingly enough, the proof in [NR09| that the analysis of 
our techniques for dK{k) is almost tight, and hence that any further substantial improvements seem to require 
fundamentally different ideas, are also derived from bounds on minimally unsatisfiable /c-DNF sets. 

Hence, minimal unsatisfiability for /c-DNF sets appears to be an interesting and useful concept. Since 
in addition the underlying combinatorial problem is natural and appealing, we do not focus exclusively on 
/c-DNF resolution in this section but also discuss our results for this purely combinatorial problem in some 
detail below. 

5.1 Proof of JH(fc) Substitution Space Theorem — Main Components 

Following the pattern set in Section @l let us start by presenting the main components of the proof and how 
they fit together to yield the theorem. As the reader will see, on a high level the proof is very similar to that 
of Theorem l2.ll but there are also some crucial differences in the details that we comment on below. 
The first part of Theorem I5. II which is Lemma l5~2l below. is established in Section l5~2l 

Lemma 5.2. Let F be any c-CNF formula and fd : {0, l} d i— > {0, 1} be any non-constant Boolean func- 
tion. Then it holds that if F can be refuted in standard syntactic resolution in length L and total space s 
simultaneously, the substitution formula F[fd] can be refuted in syntactic SH(d) in length L ■ ■ 4 cd and 
total space s ■ d2 d + (cd + 2) 3 • 4 cd + 0(1) simultaneously. 

To prove the second part of the theorem, we need to show how to convert a 9^(A;)-refutation 7T/ of F[fd] 
into a resolution refutation it of F such that the variable space of it is bounded by the space of 7rj, raised 
to the power of k + 1. This part of the proof splits into two components. In Lemma l5~4l we claim that 
each /c-DNF set D G 7rj can be "projected" onto a set of clauses over Vars(F), such that the sequence of 
projected clause sets forms the "backbone" of a resolution refutation of F. This is just as in Theorem 12. II 
except that here we lose a constant factor in the space bound when completing the backbone to a full proof. 
Then, in Lemma l5~5l we claim that if C is a set of clauses projected by a /c-DNF set D, the variable space of 
C is at most « |D| fe+1 . 

The reason that we lose a constant factor when filling in the backbone is that the definition of projected 
clauses for 9t(k), though still phrased in terms of precise implications (Definition l4.4l on pagel25l. is slightly 
different from (the stronger) Definition 14.51 This in turn depends on the fact that we cannot get this latter 
definition to work in the proof of Lemma l5~5l below. 

Definition 5.3 (Projections by £R(/c)-derivations). Let F be a CNF formula and fd a non-constant Boolean 
function, and suppose that D is a /c-DNF set over Vars(F[fd]). Writing C = C + V C~ as a disjunction of 
the positive and negative literals, respectively, we say that D projects the clause C if D > Vxgc+ fdffi) V 
Vyec- ^fdiv) holds. We let 

prvj* F (®) = {C\B> \J x€C+ f d (x) V y vec -^f d (y)} (5-1) 
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denote the set of all clauses that D projects on F. 

The difference from Definition l4.5l is that here, projected clauses are defined in terms of implications by 
the whole formula set D and not by arbitrary subsets of D. Definition 14. 51 is stronger than Definition in 
that it gives us more projected clauses, which can be used to prove sharper lower bounds, but as mentioned 
above it turns out to be a bit too strong for our techniques to work in the fc-DNF resolution case. 

Informally, the next lemma states that the projection of a A:-DNF resolution refutation of F[f d ] is es- 
sentially a refutation of the original formula F in standard resolution. The proof of this lemma appears in 
Sectionl5~3l 

Lemma 5.4. Let k > 1. Suppose that ttj = {Do> • • • > I^V} is ci 9\(k) -refutation of F[f d ]for some arbitrary 
unsatisfiable CNF formula F and some arbitrary non-constant function f d . Then the sets of projected 
clauses {proj F (B>o), . . . , pro jp(JS> T )} form the "backbone" of a resolution refutation tt of F in the sense 
that: 

• proj* F (Po) = 0. 

• proj* F (B T ) = {0}. 

• All transitions from proj F {Dt—i) to proj* F (D t ) for t £ [r] can be accomplished by axiom downloads 
from F, resolution inferences, erasures, and possibly resolution weakening steps in such a way that the 
variable space in tt during these intermediate steps never exceeds 2 • maxp g7r/ { VarSp(proj F (P)) ) . 

• The only time tt performs a download of some axiom C in F is when it f downloads some axiom 
D £ C[f d ] in F[f d }. 

The following statement is the main tecnical novelty in the extension of the substitution space theorem 
to /c-DNF resolution. It is proven in Section l5~4l 

Lemma 5.5. Suppose that f d is a k-non-authoritarian function of arity d > k and that D is a k-DNF set 
over Vars(F[f d ]) for some CNF formula F. Then it holds that VarSp(proj* F {B)) < 4 k2d ■ (k ■ Sp{B)) k+1 . 

Putting together Lemmas 15.21 15.41 and 15.51 the D\(k) substitution space theorem follows exactly as in 
the proof of the corresponding theorem for resolution in Section l4~2l We leave it to the reader to fill in the 
details and instead turn to establishing the lemmas. 



5.2 Converting Resolution Refutations of F to 9i(fc)-refutations of F\f] 

To prove Lemma IB~2l we convert a resolution refutation tt of F into a 9\(d) -refutation of the substituted 
formula F[f d ] while (roughly) preserving the length and total space simultaneously. This is done in two 
steps. First, we substitute each positive literal x appearing in a clause C in tt with some d-DNF representing 
f d (x) and similarly substitute -<x with a <i-DNF representing -<f d (x). The sequence of sets of clauses that 
was tt is transformed under this substitution into a sequence of d-DNF sets that forms the "backbone" of 
a *R(d) -refutation. Then, we convert the backbone into a proper *R(d) -refutation by simulating resolution 
inferences and axiom downloads. Consider a resolution inference step in tt which involved inferring C V C 
from C V x, C V -\x. After substitution what we need to show is that C[f d ] V C'[f d ] can be inferred from 
C[f d ] V x[f d ], C'lfd] V -ix{f d ] in Dt(d). This is shown in Lemma l5~6l below. The simulation of an axiom 
download is similarly addressed in Lemma l5~71 where we show that we can derive any d-DNF representation 
of A[fd] for an axiom A G F via a 9\ (k) -derivation of bounded length and space. Given these two lemmas, 
the proofs of which follow below, we can complete the proof of Lemma l5~2l 
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Lemma 5.6. Suppose D\ , D2 are two k-DNF formulas over r variables. IfD\ A D2 N 0, then the k-DNF set 
{Di, D2} has a 9\(k) -refutation of length \D\ \ ■ \D%\ and total space at most 2( TotSp(D\) + TotSp(D2)) 
simultaneously. 

Lemma 5.7. Suppose F is a CNF formula and D is a k-DNF formula and \ Vars(F) U Vars(D)\ = r. If 
F N D then D can be derived from F via a 9\(k) -derivation of length less than k^ ■ 2 r+1 and total space 
at most ((r + 2) TotSp(D)) 2 simultaneously. 

Postponing the proofs for a moment, let us see how these two lemmas yield Lemma l5~2l 

Proof of Lemma |5~21 Let ir = {Co, • • • , C r } be a resolution refutation of the CNF formula F. Let irj = 
{Do> • • • , Or} denote the sequence of ci-DNF sets obtained by substituting it with f d in the following way. 
We start by fixing for each literal a a ci-DNF formula representing a[f d ]. For a clause C = \J i a,; appearing 
in Ct, construct a ci-DNF formula Dc which represents C[f d ] by taking the disjunction of the ci-DNF 
formulas representing aj. Finally, set Df = {Dc \ C G Ct}. In this way, every clause in Ct turns into a 
ci-DNF formula in D t - Notice that the total space of 3 t is less than d ■ 2 d times the total space of Ct because 
every literal appearing in Ct turns under substitution into a ci-DNF with less than 2 d terms. To complete the 
proof of Lemma l5.2l it suffices to show for < t < r that Df+i can be derived from D t via a SH(d) -derivation 
of length < d AC ■ 4 cd and extra total space (cd + 2) 3 • 4 cd + 0(1). We divide into cases according to the 
type of the derivation step at time t. 

Erasure If Q+i = Cf \ {C} then by construction we have Df+i C Dt» so I^t+i can be derived in D\(d) 
from D t by erasures. 

Axiom download Let A G F be the axiom downloaded at time t + 1, i.e., Ct+i = Ct U {A}. Let A' be 
an arbitrary <i-DNF representation of recalling that A[fd] is a set of axioms of i* 1 [/</]. This set 

involves at most c • d many variables and A[fj\ 1= A'. Furthermore, A' is a DNF formula over 2cd 
many literals so it has at most 4 c d many terms and has total space at most cd4 cd . Applying Lemma l5^7l 
we conclude A' can be derived from A[fd] in length d AC ■ 2 cd+1 and variable space (cd + 2) 3 • 4 cd . 

Inference Suppose Q+i = Ct U {C V C'} where C V C is derived from C V x, C V -<x G Ct. Notice that 
(C V s) [f d ] = (C[f d \) Vx[f d ] and (C V -x) = C"[/ d ] V Since we can bound the number 

of terms in a ci-DNF formula representing x[fd] by 2 d , by Lemma 15^61 we can derive the empty DNF 
formula from d-DNF formulas representing x[f d ] and ^x[f d ] via a derivation of length at most 2 2d 
and total space at most 2 2d+1 . Applying weakening steps, when necessary, to the formulas involved 
in this refutation, we conclude that the ci-DNF formula representing (C V C) [f d ] can be derived from 
the ci-DNF formulas representing (C V x) [f d ] and (C V -<x) [f d ] via a derivation of length at most 
2 2d and 2 2d+1 extra total space. 

Weakening Suppose C m = Cf U {C V C'} for C G C t . Then the ci-DNF formula representing (C V C) [f d ] 
can be derived in a single step from the ci-DNF formula representing C[f d ] using weakening. 

We have shown how to complete the conversion of ttj into a *K(ci) -refutation of F[f d ] that is longer by at 
most a factor of d 4Cd ■ 2 cd and uses at most (cci + 2) 3 • 4 cd + O(l) extra total space. Taking into account the 
upper bound of S ■ d ■ 2 d on the total space of D t , the lemma now follows. □ 

It remains to prove Lemmas 15.61 and 15^71 We attend to them in order. 

Proof of Lemma 13731 First we claim that for every term T G D\ and for every term X" G D2 we have 

T' n {^a I a G T} ^ . (5.2) 
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To see this, assume by way of contradiction that ( I5.2t fails to hold for T G D\ and T' € D2. Consider the 
minimal restriction p that satisfies T. We see that p satisfies D\ and can be extended to an assignment that 
satisfies T" as well, contradicting the assumption D\ A D 2 N 0. 

The refutation of {D\, D 2 } proceeds by sequentially removing from D\ all its terms. Let T be a term 
of D\ that we wish to remove. By (I5.2t each term T' G L>2 contains a literal -ia such that a £ T. Apply 
A-elimination to replace T 1 by -ia. Repeating this process for each term T' G D2 we derive from D 2 in 
extra total space at most TotSp(D 2 ) the clause VaeT ~' a - R es °l ve this clause with D\ to remove T. This 
step requires extra total space at most TotSp {D\ ) + TotSp (D 2 ). Repeat the process for all T G D\ to obtain 
the empty DNF. This process required total space at most 2{TotSp{D\) + TotSp(D 2 )) and the refutation 
length is |-Di| • I-D2I so the lemma follows. □ 

Before giving the formal proof of Lemma 15.71 we explain the main ideas by working out a concrete 
example. Suppose F is the CNF that expresses the exclusive-or function over k variables and let us derive, 
in £H(fc), the DNF formula D that expresses the same exclusive-or over k variables. Recall that F has 2 fc_1 
clauses, each of size k, and similarly D has 2 fc ~ 1 terms of size k each. Our starting point is the observation 
that whenever a CNF F implies a DNF D, it holds that if we pick any one literal from each term of D and 
call this set of literals C, viewing it as a clause, then F t= C. Indeed, if this were not the case we could find 
an assignment that satisfied F but falsified C. But if C is falsified this means we have set one literal in each 
term of D to false, hence D is set to false, contradicting F t= D. 

Now, to derive D from F we proceed as follows. Let us write D for the sake of notational convenience 

as 

D = (ai A a 2 A . . . A a k ) V (b\ A b 2 A . . . A b k ) V . . . V (z\ A z 2 A . . . , Az k ) , 

where ai, . . . , a k ,bi, . . . ,b k , . . . , z k are names of literals over the k variables of our exclusive or. (Notice 
that many of these names refer to the same literal.) By the discussion in the previous paragraph we know 
that 

F\= (01 V61 V ... V21) . 

By the implicational completeness of resolution we can derive this clause from F in resolution. Similarly, 
we can derive each of the clauses 

(o 2 V bx V . . . V Zi), . . . , (ojfc V bi V . . . V zi) 

with the only difference being in the value of the very first literal in the clause. So using k — 1 applications 
of the A -introduction rule we can obtain 

((ai A a 2 A ... A aj.) V 61 V ... V 21) . 

Using the same strategy we can obtain the k — 1 formulas 

((ai A 02 A ... A ak) V 62 V ... V Zi), ... , ((oj A a 2 A . . . A a fc ) V b k V . . . V z\) 

and once we have them, another sequence of k — 1 applications of A-introduction leads to 

((ai A a 2 A . . . A afc) V (61 A 6 2 A . . . A b k ) V ci V . . . V 21) . 

Continuing in this fashion we keep increasing the number of size-/c terms in our formula until we obtain all 
of D. 

Note that using this construction, we get a length bound that is doubly exponential in the number of 
variables in downloaded axiom clauses. This happens when the number of terms in the DNF formula D is 
exponential in the number of variables r, in which case the resulting length bound is of the form fc 2n(r) . This 
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is so since our proof strategy outlined above derives each clause that is obtained by selecting one literal per 
term of D. If, as in this example, D is the /c-DNF that computes the exclusive or, the number of terms is 
2 k ~ 1 and each term has size k, thus, we derive k 2 clauses in total. We note that this seems very inefficient 
given the fact that there are only k underlying variables. Since we are not primarily interesting in optimizing 
what in the end will be constant factors, we leave as an open problem the task of reducing the length bound 
stated in Lemma l5~71 to a singly-exponential one and instead proceed with the formal proof of the lemma. 



Proof of Lemma Uyi As explained above, we derive from F in resolution a set of clauses that is equivalent 
to the A;-DNF formula D. From this set of clauses we derive D using a sequence of A -introduction inference 
rule applications. The key idea is to do all of this in a space-efficient manner by deriving the clauses one 
by one in a particular order and "merging" each derived clause into a DNF formula that, at the end of this 
process, turns out to be D. Details follow. 

Denote \D\ by s. Suppose D = Vi=i Aj=i a hi wnere h < k and ciij denotes a literal (belonging to a 
set of r variables). By the distributivity of disjunction over conjunction, D is equivalent to the CNF formula 

s 

G D := f\ \/ a itji . (5.3) 

ji,...J a e[ki\x...x[k a ] i=l 

Each clause of Gd is implied by F because otherwise there would be an assignment satisfying F but 
falsifying Gd, thereby falsifying D as well, in contradiction to the assumption F 1= D. By the implicational 
completeness of resolution (Proposition I3.8t there is a resolution derivation of each clause of Gd from F. 
This derivation has length less than 2 r+1 and space at most (r + 2) 2 because it involves at most r variables. 
We now show how to construct D from the clauses of Gd. 

For s' e [s] and j = {j s > +1 , . . . ,j s ) € [Av+i] x . . . x [k s ], let 

(s 1 ki \ s 

V A a M V V • (5-4) 

j=lj=l J i=s'+l 

We prove by induction on s' > that D.-i can be derived in total space 

(r + 2)(TotSp(D s ,j))y = ^(r + 2) (j^ kl + (s - s')^ (5.5) 

and length less than k s 2 r+1 . The base case (s' = 0) follows from the discussion in the previous paragraph 
because D Q j is a single clause that is implied by F. For the inductive step assume the claim holds for s' — 1. 
We show how to derive, for k' = 1, . . . , k s i, the formula 

A'-i h \ I k' \ s 
D' k > : V A "■■< A "A, V \/ a idi (5.6) 

\i=lj=l / \j=l / i=s'+l 
in length less than k'k s '~ 1 2 r+1 and total space 

{(r + 2)(TotSp(D' k ,))) 2 = ((r + 2)(j2ki + k' + (s- S ')]] . (5.7) 
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This is shown by induction on k' > 1. For k' = 1 notice (I5.6t is nothing but £} S '-l,(l,;7 s / so by tne 

inductive hypothesis with respect to s' — 1 it can be derived in length less than k s ~ 1 2 r+1 and total space 



fa 1 -X 



(r + 2) [Y,^ + {s-(s'-l))\ = (r + 2) J> + fc' + (s - S ') (5.8) 



.i=l / / \ \i=l 



For the inductive step assume we have derived D' k , using at most the total space stated in d5.7t . Erase all 
formulas in the memory but for D' k , and notice this remaining formula has total space 



s'-l 



J2 k i + k' + (s-s') . (5.9) 

i=l 

Using the inductive hypothesis on s' — 1 again, derive the DNF formula 

s'—l ki \ s 

\J f\ (Hj V a s ' jk '+x V \/ a iijt (5.10) 

i=l j=l / i=s'+l 

in total space as in (I5.8t and length less than fc s _1 2 r+1 . Notice that the total total space used is bounded 
by the sum given in i5.Sl plus the sum in (I5.9t (this latter space is required to save the formula D' k ,) so the 
combined total space is at most 

/ /s'-l \ \ 2 s'-l 



(r + 2) £ A* + («-(«' -1)) +J>. + + 



,i=l / / i=l 

/ /s'-l \ \ 2 



, 8=1 



Now combine D^,, and (I5.10t using a single /\-introduction step to obtain Z?fc/+l- We see that Dfc' + i can be 
derived in total space bounded by ( I5.8t and length less than k'k s ^ 1 2 r+1 . Summing over J\i — 1 ..... k we 
conclude that the derivation of D gl+l j is of length less than k • k s ~ 1 2 r+1 and total space 

^(r + 2)^^ + («-(a' + l))^ (5.12) 

as claimed. Setting s' = s and noticing TotSp(D) = Ylt=x ^ finishes the proof of the lemma. □ 

The following easy corollary of the proofs above will also be useful to us. 

Corollary 5.8. Let F be any c-CNF formula and fd ■ {0, l} d i— > {0, 1} be any non-constant Boolean func- 
tion. Then it holds that if F can be refuted in standard syntactic resolution in length L and formula space s 
simultaneously, the substitution formula F[fd] can be refuted in syntactic SR(cQ in length L ■ d ACd ■ 4 cd and 
formula space s + 2 d + 0(1) simultaneously. 

Proof. This follows from inspection of the proofs of Lemmas 15.61 and 15.71 noting that every clause in the 
standard resolution proof can be represented by a single corresponding ci-DNF formula in the lH((i)-proof. 
We omit the details. □ 
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5.3 Projected ^(/^-refutations Are (Essentially) Resolution Refutations 

We next prove the converse of Lemma I5T21 namely that !tK(/c)-refutations of substituted formulas can be 
translated back to standard resolution refutations of the original formulas as claimed in Lemma l5"4*l 

This part of the substitution space theorem for fc-DNF resolution is the one that is the most similar to 
the theorem for standard resolution, and the proof of Lemma f5~4l closely follows the structure of that of the 
corresponding Lemma l4~6l We note, though, that there are a few subtle differences between the two proofs 
due to the fact that our definition of precise implication (Definition l5.3t is somewhat different than the one 
used in the substitution space theorem for resolution (Definition l4.5l) . Definition 14-51 appears to be "the right 
definition" and yields tighter results for standard resolution, but for technical reasons we are forced to relax 
it a bit in order to obtain the results for fc-DNF resolution. 

We first fix some notation. In analogy with Section 14.41 we use the convention that D and D denote 
fc-DNF sets and fc-DNF formulas over Vars{F[f<i\) while C and C denote clause sets and clauses over 
Vars(F). Let us also overload the notation and write D 1= C, K C, and D > C for C = C + V C~ when 
the corresponding implications hold or do not hold for D with respect to Vxgc+/d(^) V \ly^c-^fd{y)- 
Finally, let Q be a shorthand for proj* F (Bi t ). 

Suppose now that 7rj = {H>o, ■ • • > I^V} is a /c-DNF resolution refutation of F[fd] for some arbitrary un- 
satisfiable CNF formula F and some arbitrary non-constant function /<j. The first two bullets in Lemma l5~4l 
are immediate. The hard part is to show that all transitions from Q_i = proj* F {Pt-i) to C t = proj F (B>t) 
can be carried out in such a way that the variable space never exceeds VarSp (Cj_i) + VarSp(Ct) < 
2 • max s€ [ T ] { VarSp(C s )} during the intermediate derivation steps needed. The proof is by a case analysis 
of the derivation steps. In the case analysis we will again need Observation 14. 81 which is easily seen to hold 
in this context as well with exactly the same proof. 

Consider now the rule applied in nf at time t to get from B>t-i to D*. We analyze the three possible 
cases — inference, erasure and axiom download — in this order. 

Inference Note that B>t-i N since all inference rules are sound. Moreover, since O t 5 D>t-i we have 
B>t t= Ut-i. It follows from Definition 15 . 3 1 that the set of projected clauses does not change, i.e., Q_i = Q, 
and nothing needs to be done. 

Erasure If C G Q \ Ct_i is a new projected clause appearing at time t as a result of an erasure D t = 
Ot_i \ {D}, it clearly holds that B>t-i ^ C. Hence, all such clauses C G Q \ Q_i can be derived by 
weakening from Q_i by Observation 14. 81 after which all clauses in Q_i \ Ct can be erased. During these 
intermediate steps the variable space is upper-bounded by VarSp(Ct-iUCt) < VarSp(Ct-i)+ VarSp(Ct). 

Axiom download This is the place in the case analysis where we need to do some serious work. Suppose 
that D t = Dj_i U {D} for some axiom clause D S -A[/(f], where A in turn is an axiom of F.lfC<E Ct 
is a new projected clause then we must have H>t-i ¥ C and B f _i U {D} > C. 

We want to show that all such clauses C can be derived from Q_i = proj* F (B>t-i) by downloading 
Aei 7 , making inferences, and then possibly erasing A, and that this can be done without the variable space 
exceeding VarSp(Ct-i) + VarSp(Ct). Our tool for proving this is the next technical lemma, which is the 
analogue of Lemma l4~9l 

Lemma 5.9. Let D be a k-DNF set derived from D G F[fd], D G A[fd] be an axiom clause ofF[fd], and C 
be a clause over Vars(F). If ID, D, and C are such that D U {D} [> C but D ¥■ C. Then if A = a\ V- ■ -Vaj., 
for every a, G A \ C there is a subclause C l C C such that D > C % V Sj. That is, all clauses C Va 8 for 
a,i G A\C can be derived from C = proj* F (D) by weakening. 
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Proof. Consider any assignment a such that a(D) = 1 but a(V x£C+ f d (x) V \ly £ c-^fd{v)) = 0- Such 
an assignment exists since D ¥■ C by assumption. Also, since by assumption D U {D} t> C we must have 
a(.D) = 0. If ,4 = oiV- • -Vo s , we can write!) G -A[/ d ] on the form!? = D x V- • • VD S for Di G aj[/ d ]. Fix 
any a E i and suppose a = x is a positive literal. Then a(Di) = implies that a(f d (x)) = which means 
that = 1. Since exactly the same argument holds if a = y is a negative literal, we conclude 

(using our overloaded notation) that 

DhCVSj . (5.13) 

If di G C, the clause C V a, is trivially true, but otherwise we pick C % C C minimal such that (I5.13t still 
holds (where since D C, the literal a, cannot be dropped from the implication). Then by Definition 15.31 
we have D > C l V aj as claimed. □ 

Lemma l5~9l tells us that every x G Fars(A) \ Vars(C) appears in some clause at time t — 1, namely, in 
the clause C l V Hi found in the proof above. Since in addition obviously Vars(A) n Vars(C) C Vars(Ct) 
this means that if we download A G F in our refutation 7r : F h under construction, we have Vars (A) C 
Vars(C t -i) U VarsQCt) and hence VarSp{C t -i U {A}) < Var5p(C t _i) + Var5p(C t ). 

Thus, we can download A G F, and then possibly erase this clause again at the end of our intermediate 
resolution derivation to get from Q-i to Q, without the variable space ever exceeding VarSp{Ct-i) + 
VarSp(Ct). The concluding argument that all new clauses C £ Q\ Ct_i can be derived from Ct-i U {vl} 
is exactly as in Section 14.41 but using Lemma 15.91 instead of Lemma 14.91 and it is easily verified that we 
never mention any variables outside of Vars (Q_i) U Vars(A) U Vars (C) in these derivations. 

Concluding the proof of Lemma I5*!4*l we have established that no matter what derivation step is made in 
the transition Dt_i *+* Ot, we can perform the corresponding transition Q-i Q for our projected clause 
sets without the variable space going above VarSp(Ct-i)+VarSp(Ct) < 2-maxo e7r/ . { VarSp(proj* F (jS)))} . 
Also, the only time we need to download an axiom A G F in our projected refutation 7r of F is when Try- 
downloads some axiom D G This completes the proof of Lemma 15 .41 

5.4 On the Size of Minimally Unsatisfiable and Minimally Implicating h -DNF Sets 

We now prove the third and final lemma in Section l5~Tl namely Lemma l5~5l bounding the number of vari- 
ables appearing in a /c-DNF set that minimally implies a formula. We first deal with the "special case" of 
minimally unsatisfiable sets (Theorem 12. 141 on page[T3l. The actual result in Lemma 1531 needed to prove 
the substitution space theorem follows the outline of this simpler case. 

The following simple but important lemma will be used both in the proof of Theorem 12.141 and of 
Lemma l53l 

Lemma 5.10. Suppose that D is a k-DNF set that minimally implies a formula G. Then for every literal a 
appearing in any term T in a k-DNF formula D G there exists a restriction p to Vars(D) satisfying 

• \p\ < fc|B|. 

• D'\ p = Ifor all D'eD \ {D}- 
. (T\{a})\ p = l. 

• G\ p + 1. 

The point here is that, intuitively speaking, the restriction p is very nearly satisfying the /c-DNF set D 
(except for a single literal in a single term) but still has not fixed the formula G implied by D to true. Also, 
p assigns values to comparatively few variables. 
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Proof of Lemma \nU\ By Definition 12. 121 there exists an assignment a to Vars(0) such that a(D') = 1 
for all A G B' \ {D} and a(T \ {a}) = 1 but a(G) = 0. Let p be a restriction of minimal size that 
agrees with a and satisfies the second and third bullet in the statement of the lemma. Such a restriction can 
be found by selecting one term T' satisfied by a in each D' G W \ D and setting p to agree with a on 
(J j Vars(Tj) U Vars(T \ {a}) and be unfixed elsewhere. Since \T'\ < k we see p has size < fc|B'|. The 
last bullet stated above holds because G{a) = and p agrees with a on all variables fixed by p. □ 

Let us now restate and then prove Theorem l2. 141 

Theorem 12.141 (restated). Suppose that D is a minimally unsatisfiable k-DNF set. Then the number of 
variables in D is at most | Vars(D)| < (k • |D|) fc+1 . 

Proof. Let D = {D x , D m } be a fc-DNF formula set with m = |B|. For S a set of literals, let D^S) be 
the set of terms in A that contain S (recall that we identify a term with the set of literals appearing in it). 
Formally, 

D t (S) = {TeDi,:T^S} . (5.14) 

Let Vars(Di(S)) denote the set of variables appearing in the set of terms Di(S). Our theorem follows from 
the next claim. 

Claim 5.11. If Sis a set of literals and \S\ = k — r then | Vars(Di(S))\ < * ■ (km) r . 

Before proving the claim, let us complete the proof of the theorem. Take S = for which we get r = k 
and notice that Di{%) = Di. Claim l5~TT1 gives 

| Vora(A)| = I Vars(D0j)\ < k{km) k (5.15) 

and summing over all all m formulas in the set we get 

| Vars(B)\ < YT=i\ Vars{Di)\ < m ■ k{km) k = (km) k+1 = {k\B\) k+1 (5.16) 

which concludes the proof. □ 

Proof of Claim IT771 By induction on r > 0. For the base case of r = notice \S\ = k so there can be at 
most one term in Di that contains all literals in S implying | Vars(Di(S)) \ is either or k. 

For the inductive step we may assume the existence of some term T G Di that strictly contains S, 
because otherwise S appears at most once as a term in Di and the claim holds as in the base case. Assuming 
T 2 S, let a be a literal in T \ S. Lemma B. lOl suarantees the existence of a restriction p of size at most km 
such that Dj\ = 1 for all j G [m],j ^ i and (T \ {a})f p 7^ 0. By the unsatisfiability of D we conclude p 
falsifies every term T G A for which T' D S. Since (T \ {a}) \ = 1 and (T \ {a}) 5 S we conclude that 
every term in A that contains S must also contain a literal set to false by p, because otherwise p could be 
extended to an assignment satisfying D. Recall that -<p is the set of literals set to false by p. We have just 
shown that 

A(S) = U^ p A(5U{a'}) . (5.17) 

So to bound Vars(Di(S)) we need only bound Vars(Di(S U {a'})) for all a' G -<p. We use the inductive 
hypothesis. Notice (-ip) n 5 = because p satisfies S. Thus, for a' G ->p we have \ S U {a'}\ = k— (r — 1). 
Apply the inductive hypothesis to S U {a'} to conclude 

I Vara(A(S))| < Ea'e^pl Vbr«(A(5 U {a'}))\ < k(km) r - 1 . (5.18) 
Summing over all a G -<p and recalling that |->p| = \p\ < km establishes the claim. □ 
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To prove Lemma 1531 we need to address two issues that did not appear in the previous proof. First, 
our starting point is a /c-DNF set B that is satisfiable and implies a set of projected clauses. We deal with 
this by constructing a formula (to be denoted G') that is the conjunction of all clauses projected by B. The 
second issue, which is more subtle, is that B is a set of formulas defined over Vars(F[fd\) whereas the 
clauses projected by D are over the different variable set Vars(F). The following definition provides some 
convenient notation for connecting the two sets of variables. 

Definition 5.12 (Shadow). For a a literal over a variable y G Vars(F[fd\) let the shadow of a, denoted 
V(a), be the variable x G Vars(F) to which a belongs, i.e., the shadow of y is the variable x such that 
y G Vars(x[fd])- For T a set of literals (which we will later identified with a term or a restriction) let its 
shadow be V(T) = \J a£T V(a) and for D a set of terms we define its shadow as V(Z?) = \] TeD V(T). 

The following technical lemma, which will be proven later on, is the analog of Claim l5~TT1 accounting 
for the needed modifications which were discussed in the beginning of this subsection. The claim in this 
lemma is also the central point in our proof of Lemma IB31 We now state the lemma and promptly use it to 
prove Lemma l5~5l 

Lemma 5.13. Suppose B = {D±, . . . , D m } is a k-DNF set over Vars{F[f ( j\) and G is a CNF formula 
over Vars(F) such that B minimally implies the substituted formula G' = G[fd\- Suppose furthermore that 
S C Vars(F) and \S\ = k - r for r > 0. Then, letting D-(S) = {T G A | V(T) 5 S} denote the set of 
terms in Di whose shadow contains S, we have \V(Di(S))\ < k ■ (4 fcrf • /c|B|) r .. 

Proof of Lemma \53\ Let B = {Di, . . . , D m } and G' = f\ceproj* F (B) C[fd\- Notice that by Definition 15.31 
G' is of the form G 1 = G[fd] for some CNF formula G over Vars(F) so G' conforms to the assumptions of 
Lemma l5.13l 

First we argue that we may assume without loss of generality that B minimally implies G' . If this is not 
the case, there must exist a term T appearing in Di G B and a proper subterm T' ^ T such that replacing 
T by T' and calling the replaced /c-DNF set by W, we still have W \= G'. In this case set W = B and repeat 
the process. Notice that repeating the process does not increase the size of B (in fact, the size can shrink 
if some /c-DNF formula includes an empty term). Since each repetition of this process strictly shrinks the 
number of literals in B (counted with repetitions), we see it must terminate. Upon termination the remaining 
/c-DNF set, denoted B, which is of size at most m, minimally implies G' . 

Our next observation is that for every variable x appearing in G there must exist a literal a belonging to 
it that appears in B. To see this, argue by way of contradiction. Let C = C' V x be a clause appearing in G 
and assume for simplicity that x is a positive literal (the case of a negative literal is identical). Conditions 
(14. lab and (14. lbb of Definition 15 . 3 l imply that there exists an assignment a to Vars(F[fd]) such that a(B) = 
a(x[fd]) = 1 but a(C'[fd\) = 0. By construction, B 1= B so a(B) = 1 as well. By assumption, no variable 
belonging to x appears in B, so by changing the value of a on Vars(x[fd\) as to falsify x[fd] we reach an 
assignment that satisfies B but falsifies G[fd], contradiction. To simplify notation from here on we assume 
without loss of generality that B minimally implies G' and note for the record that 

VarSp(G) = \ Vars(G)\ < |V(B)| . (5.19) 

Next, for D G B we bound |V(L>)| using Lemma l5.13l and get 

|V(D)| = |V(£>(0))| < k • (4 kd • k\B'\) k < k ■ (A kd ■ k\B\) k . (5.20) 

Summing over all D G B gives 

|V(B)| < ^ l v 0°)l < |B| • /c • (A kd • fc|B|)* = 4 fc2rf • (k\B\) k+1 (5.21) 
Deo 

and this, together with (I5.19t . establishes Lemma 1531 □ 
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Hence, all that remains now to complete the proof of Theorem 15. H is to show Lemma B. 131 We end this 
section by presenting a proof of this final technical lemma. 

Proof of Lemma UH] By induction on r > 0. For the base case of r = we have |5| = k. Since A is a 
fe-DNF formula then any term T for which V(T) => S must have V(T) = S. Thus, \V(Di(S))\ = k and 
the inequality claimed in the lemma holds. 

For the inductive case of r > 0, let S denote the set of literals that belong to S, and let terms(5) 
denote the set of terms over S. We bound the number of terms by |terms(5)| = 2 2 ' Vars ( s )\ < 4 dk because 
each term is a set of literals coming from a set of literals of size 2| Vars(S)\. Partition the terms in Di(S) 
according to their intersection with S. Formally, for every s G terms(S) let 

Di(s) = {T G Di(S) | T n S = s} . (5.22) 

We have partitioned Di(S) into A kd partitions so to prove the claim in the lemma it is sufficient to show for 
each partition that 

|V(A(s))| <km(k- (4 M fcm) r_1 ) . (5.23) 

Consider one term s G terms(5). If V(A(s)) = S then clearly (I5.23t holds so we assume V(A(s)) ^ S 1 . 
In this case there exists T G Di such that V(T) 2 5 1 which implies the existence of a literal a G T\ S. Let 
p be a restriction satisfying the properties of Lemma B. 101 with respect to a, T, W and G'. 

Proposition 5.14. Every term T' appearing in Di(s) must include a literal a G" S whose shadow belongs to 
the shadow of p as well. Formally, (V(T') \ S) n (V(p) \ S) + 0. 

Proof of Proposition 15. 741 By way of contradiction. Assume T" falsifies the proposition. By assumption T' 
has the same set of literals as T within S and the third property of p listed in Lemma B . 1 Ol implies p satisfies 
all literals of T' inside S. Assuming that the intersection in the statement of the proposition is empty, we 
can extend p to a restriction p' that satisfies T' by setting at most k "new" variables on top of those set by 
p. The crucial observation is that none of the "new" variables set by p' have their shadow in V(/j). More 
to the point, suppose Xj is a "new" variable whose value is set by pf but is not set by p. Let x denote the 
shadow of Xj and let x = {x%, . . . , Xd} be the set of variables whose shadow is x. Our crucial observation, 
restated in different words, is that p does not set the value of any variable x\, . . . ,Xk- This is where the 
A;-non-authoritarianism of fd comes into play, because it implies that p' cannot fix the value of fd(%) by just 
setting a single variable Xj. But this means that we can extend p' so that fd(x) will obtain any truth value 
we find fit. We conclude that the fourth property listed in Lemma IB . 1 01 holds for p' as well as for p. This 
property implies that p' can be extended to an assignment a' such that G'(a') = 0. So a' is an assignment 
that satisfies W but falsifies G' . We have reached a contradiction, and the proposition follows. □ 

We continue with the proof of the inequality (I5.23t . The second property of Lemma B. 101 implies that 
|V(p)| < km. Thus, Proposition 15. 141 shows that there exists a set V s C Vars(F) \ S of size at most km 
such that 

V(A0)) C [J V(Di(S U {«})) . (5.24) 

V&Vs 

Since v G" S we have \S U {v}\ = k — (r — 1) so we may apply the inductive hypothesis of the inequality 
in Lemma B.13l to S U {v} which gives 

|V(A(«))| < Yl \ y ( D i( S u {«}))! <km(k- (A kd ■ km) r ~ 1 ^ . (5.25) 

We have shown that the inequality (15.23b holds for all s G terms(S'). Summing over all terms, there are at 
most 4 kd of them. The lemma follows. □ 
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6 Reductions Between /, -DNF Resolution and Pebbling 

It is not hard to see how a black pebbling V of a DAG G can be used to construct a resolution refutation 
of the pebbling contradiction Peb G in Definition 13. 12l in length and space upper-bounded by time(V) and 
space(V), respectively. It is straightforward to show that this translation from pebblings to refutations 
works even if we do an /^-substitution in the pebbling contradiction. We present a proof of this fact in 
SectionIO 

Using our new results in Section we can prove the more surprising fact that there is also a fairly tight 
reduction in the other direction: provided that the function fd is non-authoritarian, any resolution refutation 
of Peb G [fd] translates into a black- white pebbling of G with the same time-space properties (adjusting for 
constant factors depending on the function fd and the maximal indegree of G). This new reduction is given 
in Section l6~2l 

In Section 16.31 corresponding versions of both theorems are given for fc-DNF resolution. Finally, in 
Section 16.41 we put everything together to prove a meta-theorem saying that for DAGs G having the right 
time-space trade-off properties, we can prove that pebbling contradictions defined over such DAGs inherit 
(roughly) the same trade-off properties. This will allow us to use pebbling time-space trade-offs to obtain 
a wealth of strong trade-offs for both formula space and total space for standard resolution and in 
Section^] 

6.1 From Black Pebblings to Resolution Refutations 

In Lemma l23l we recalled that given any black-only pebbling V of a DAG G, we can mimic this pebbling 
in a resolution refutation of Peb G by deriving that a literal v is true whenever the corresponding vertex in G 
is pebbled. This construction carries over also to substitution formulas Peb G [fd] and we have the following 
theorem. 

Theorem 6.1. Let fd be a non-constant Boolean function of arity d and let G be a DAG with indegree at 
most £ and unique sink. Then given any complete black pebbling V of G, we can construct a resolution 
refutation it : Peb G [f(j\ h such that 

L(ir) < time(V) ■ exp(0(d(^ + 1))) , 
W(tt) < d{£ + 1) , and 
TotSp(ir) < space{V) ■ exp(0(d{£ + 1))) . 

Proof. Follows directly from Lemma 1231 and Theorem l2.ll □ 

We note that in our applications we will have the function arity d and the DAG indegree £ fixed (for 
standard resolution, we can pick d = £ = 2), which means that the bounds on length and space above turns 
into L(ir) = 0(time(V)) and TotSp(ir) = 0(space(V)). We also remark that for concrete functions fd, 
such as for instance XOR over two variables, we can easily compute explicit upper bounds on the constants 
hidden in the asymptotic notation if we so wish, and these constants are small. 

6.2 From Resolution Refutations to Black-White Pebblings 

In the other direction, i.e., from resolution refutations to pebbling strategies, the current paper establishes 
the following correspondence for resolution (which is a slightly more general version of the statement made 
in Theorem l2.4l) . 

Theorem 6.2. Let f be any non-authoritarian Boolean function and G be any DAG with unique sink and 
bounded indegree £. Then from any resolution refutation n : Peb G [f] h we can extract a black-white 
pebbling strategy V^for G such that time{V v ) <(£ + !)■ L(n) and space(V w ) < Sp(ir). 
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Before proving this theorem, we want to highlight the fact that Theorems 16.11 and 16.21 are far from 
being perfect converses of one another. This is so since the reduction in one direction uses black pebbling 
(Theorem l6.U while the reduction in the other direction is in terms of black-white pebbling (Theorem l6.2l i. It 
was shown in [ KS9 1 1 that there can be a quadratic gap in pebbling price depending on whether white pebbles 
may be used or not. At first sight this might not look to bad, since [Mey81 1 proved that the gap can never 
be worse than quadratic. However, the translation given in [Mey81 1 of a black- white pebbling in space s to 
a black-only pebbling in space 0(s 2 ) incurs an exponential blow-up in pebbling time, destroying all hope 
of obtaining nontrivial tradeoff results in this way. Hence, to get meaningful tradeoffs from Theorems 16.11 
and !6.2l we need a graph family where either the trade-off properties with respect to black- white and black- 
only pebbling are closely related, or where we can translate not only black but also black-white pebbling 
strategies into resolution refutations in a way that preserves both time and space. 

In this paper, we will use both of these approaches to get tight trade-off results. Some pebbling results 
in the literature, notably several results in the monumental paper [LT82|, have been shown to hold asymp- 
totically for both black-white and black-only pebbling, and we also appeal to INorlObl to get some new 
such graphs. Also, in certain limited settings it turns out to be possible to simulate black-white pebblings 
in resolution, and for such graphs we can again get tight proof complexity trade-offs drawing on structural 
results in [NorlOb]. These results hold only for a particular type of black-white pebblings, however, and it 
remains an interesting open problems whether or not black-white pebblings can be simulated efficiently in 
resolution in general. 

Proof of Theorem W2\ The proof is in three steps: 

1. First, we convert it : Peb G [f] h to a refutation ir' of Peb G such that VarSp(ir') < Sp(ir) and the 
number of axiom downloads in n' is upper-bounded by the number of axiom downloads in n. This is 
Theorem |2~T1 

2. The refutation ir' : Peb G h can contain weakening moves, which we do not want, so we appeal 
to Proposition l3.6l to get a refutation ir" : Peb G \~0 without any weakening steps. By Lemma 13.101 
without loss of generality we can assume that ir" is frugal (Definition 13.91 . This part of the proof 
just uses standard techniques, and the number of axiom downloads and the variable space can only 
decrease when going from n' to ir" . 

3. Finally, we show that ir" corresponds to a black- white pebbling strategy V for G such that time(V) is 
upper-bounded by the number of axiom downloads and space(V) is upper-bounded by the maximal 
number of variables occurring simultaneously in tt". This final part was essentially proven by the first 
author in [Ben09 1, but since we need a more detailed result than can be read off from that paper, we 
present the full construction below for completeness. 

Putting together these three steps, Theorem 16 . 21 clearly follows. □ 

What remains is thus to show the following, slightly more detailed version of a lemma in [Ben09 1, which 
serves to establish part[3]in the proof above. 

Lemma 6.3 (|Ben09|). Let G be any DAG with unique sink and bounded indegree i, and suppose that 
7r is any resolution refutation of Peb G without weakening that is also frugal. Then there is a black-white 
pebbling strategy for G such that spaCBiV^) < VarSp(w) and tifineiV^) is at most (I + 1) times the 
number of axiom downloads in n. 

Proof. Given a refutation n = {Co = 0, Ci, . . . , C r = {0}} of Peb G , we translate every clause set Q into 
a black-white pebble configuration Ft = (B t , Wt) using a slightly modified version of the ideas in [Ben09|, 
and then show that V = {Po 5 . .. ,P T } is essentially a legal black- white pebbling of G as in the statement 
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of the lemma. The translation will satisfy the invariant that B t U Wt = Vars(Ct) which yields the upper 
bound on pebbling space in terms of variable space. The first configuration Co = is thus translated into 

Po = (0,0). 

Suppose inductively that (Bt—i, Wt-i) has been constructed from Ct—i and consider all the variables 
x G Vars(Ct) one by one. If x G Vars(Ct) H B t -\, keep x in B t . Otherwise, if x G Lit(Ct) appears as a 
positive literal, add x to B t . Otherwise, if x G Lit(Ct), add x to W^. This is our translation of Q into black 
pebbles S t and white pebbles To see that this translation yields a legal pebbling, consider the derivation 
rule applied to get from Q-i to Q. 

Axiom download Suppose that we download the pebbling axiom or source axiom for a vertex v with imme- 
diate predecessors ui, . . . , ug> (where we have £' = for a source v). All predecessors m not having 
pebbles on them at time t — 1 get white pebbles. Then u gets a black pebble, if it did not already 
have one. Note that this is a legal pebble placement since all immediate predecessors of v (if any) 
have pebbles at this point. We remark that to black-pebble v, we might have to remove a white pebble 
from v first, but since all immediate predecessors have pebbles on them this poses no problems. Also, 
downloading the sink axiom places a white pebble on the sink z if this vertex is empty, which is a 
legal pebbling move. By the bound on the indegree, this step involves placing at most I + 1 pebbles. 

Inference In this case Vars(Ct-i) = Vars(Q), so nothing happens. 

Erasure Suppose that the clause erased in C. Just apply the translation function. Suppose that this results 
in a pebble on x disappearing. Then we have x G Vars(C) but x ^ Vars(Ct). Before being erased, 
C has been resolved with some other clause (recall that tt is frugal). But as long as we did not resolve 
over the variable x, we will still have x G Vars(Q), and hence C must have been resolved over x at 
some time t' < t. At this time x appeared both positively and negatively in C t > , and in view of how 
we defined the translation from clauses to pebbles, this means that the vertex x has contained a black 
pebble in the interval [t',t — 1]. Thus the pebble disappearing at time t is black, and black pebbles 
can always be removed freely. 

To conclude the proof, note that during the course of the refutation all axioms must have been down- 
loaded at least once, since Peb G is easily seen to be minimally unsatisfiable. In particular, this means that 
the sink z is black-pebbled at some time during the proof, and we can decide to keep the black pebble on 
z from that moment onwards. (This potentially adds one pebble extra to the pebbling space, but this is fine 
since the inequality in Theorem l2.1l is strict so there is margin for this.) 

Since every time an axiom is downloaded it must also be erased at some later time, we get the time 
bound of (£ + 1) times the number of axiom downloads (and in fact it is easy to see that this bound can be 
improved by taking into account the inference steps, when nothing happens in the pebbling). The lemma 
follows. □ 

6.3 Reductions Between Pebbling and /,-DNF Resolution 

For fc-DNF resolution, we get the following, slightly different, reductions. The next theorem is particularly 
interesting when we have d = K + 1, which we can get, for instance, by picking fd{x\ , . . . , Xd) = ©iLi^i- 

Theorem 6.4. Let K be any fixed positive integer. Suppose that : {0, l} d i— > {0, 1} is any K-non- 
authoritarian Boolean function and that G is any DAG with unique sink of size Q(n) and bounded vertex 
indegree i. Then the following holds. 

I. The formula Pcb G \ff\ is refutable in syntactic fR(d) in length O(n) and formula space 0(1). 
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2. For any k < K, from any semantic %\{k) -refutation tt of Peb G [fd] we can extract a black-white 
pebbling 7\ of G such that time^-n) = 0(L(tt)) and space^^) = 0(Sp{-K) k+l ). 

Any constants hidden in the asymptotic notation depends only on K, d, and I. 

Proof. Part |2 of the theorem follows just as in the proof of Theorem 16.21 but appealing to Theorem 15.11 
for 9\(k) instead of Theorem 12. II for standard resolution. For part^ we use the result in [Ben09| that any 
formula Peb G can be refuted in linear length and constant clause space simultaneously, and then appeal to 
Corollary ES] □ 



6.4 Obtaining Resolution Trade-offs from Pebbling 

Combining the theorems above, we can now prove that if we can find DAGs G with appropriate pebbling 
trade-off properties, such DAGs immediately yield trade-off results in resolution. And as we will see in 
Section^ there are (explicitly constructible) DAGs with the needed properties. 

In order not to clutter the statement of the next theorem, we assume that the indegree £ of the DAGs, the 
arity d of the Boolean functions /, and the maximal size of terms K in the /c-DNF resolution proof systems 
that we consider are all fixed, so that any dependence on K, d and I can be hidden in the asymptotical 
notation. (This is not much of a restriction since we will always be able to choose I = 2, and also we will 
have d = K + 1 in the applications that we care about.) 

Theorem 6.5 (9^(/c)-trade-offs from pebbling). Let K, d, and i, be arbitrary but fixed positive integers 
such that K < d, and let f : {0, l} d i— > {0, 1} be some K -non-authoritarian function. Suppose that G is a 
DAG with n vertices, unique sink, and bounded indegree I, and that g,h : N + i— > N + are functions satisfying 
the following properties: 

• For every s > Peb{G) there is a black pebbling V of G with space(V) < s and time(V) < g(s). 

• For every s > BW-Peb(G) and every black-white pebbling V of G with space(V) < s it holds that 
time{V) > h(s). 

Then the following holds for the substitution pebbling formula Peb G [f]: 

1. Peb G [f] is a CNF formula of size 0(n) and width 0(1). 

2. Peb G [f] is refutable in standard syntactic resolution in length L^(Peb G [f] h 0) = O(n) and width 
W(Peb G [f] h 0) = 0(1) simultaneously, and is also refutable in syntactic resolution in total space 
TotSp< H (Peb G [f]\-0) = O(Peb(G))/ 

3. Peb G [f] is refutable in syntactic 9\(d) in length L<^^(Peb G [f] h 0) = O(n) and formula space 
Sp^^(Peb G [f] h 0) = 0(1) simultaneously. 

4. For every s > Peb(G) there is a standard syntactic resolution refutation ir s : Peb G [f] h in length 
L(tt s ) = 0(g(s)) and total space TotSp(ir s ) = O(s). 

5. The clause space of any semantic resolution refutation is lower-bounded by Sp<y^(Peb G [f] h 0) > 
BW-Peb(G), and for every s > BW-Peb(G) and every semantic refutation ir s : Peb G [f]^Q in 
clause space Sp(ir s ) < s, it holds that L(tt s ) = £l(h(s)). 

6. For every s > BW-Peb(G) and every k < K, the formula space of any semantic 9\(k)- refutation is 
lower-bounded by Sp<^^(Peb G [f] h 0) = f!( k+ -{/ BW-Peb(G)) , and any semantic %\(h) -refutation 
tt s : Peb G [f] h in formula space Sp(ir s ) = o( k+ \fs), it holds that L(ir s ) = Cl(h(s)). 
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All hidden constants in the asymptotical notation depend only on K, d, and £, and are independent ofG. 

Proof. Item Q on the list is an easy consequence of Definition 13.141 The rest is just a matter of putting 
together all the theorems proven in this section. Hence, items |2] and |4]both follow from Theorem l6.1l (to get 
item|2l consider the trivial pebbling that black-pebbles all vertices of G in topological order). Theorem l6.2l 
yields item|5] Finally, items |3] and |6] follow from Theorem l6.4l □ 

This theorem will be of particular interest when we can find graph families {G n }^ ( L 1 with Peb(G n ) = 
@(BW-Peb(G n )) having trade-off functions g n (s) = &(h n (s)). For such families of DAGs, Theorem 16. 51 
yields asymptotically tight trade-offs in standard resolution. As we can see, these trade-offs hold for both 
clause space and total space simultaneously with respect to length, since the upper bounds are in terms of 
total space and the lower bounds in terms of clause space. 

For fc-DNF resolution, k > 2, the trade-offs obtained by using Theorem 16.51 are not tight, since there is 
a gap of a (k + l)st root between the upper and lower bounds on space. We do not know what the "correct" 
result for JH(fc) is, and a priori it could well be the case that qualitatively the same trade-offs should hold for 
9\(k) as for standard resolution. Such a strengthening of Theorem l6.5l provablv cannot be obtained by our 
methods, however. In view of the recent results in [NR09|, our techniques will always leave at least a kth 
root gap, so in order to make any more substantial improvements to Theorem l6.5l it seems that fundamentally 
new ideas would be needed. 

7 Separation and Trade-off Results for / -DNF Resolution 

We have finally reached the point where we can state and prove our time-space trade-off results for standard 
resolution and SH(fe). Given all the work done so far, the proofs are all variations of the following pattern: 
pick some suitable graph family, make the appropriate choices of parameters, consider the corresponding 
pebbling contradiction CNF formulas, do /-substitution for some non-authoritarian function /, and ap- 
ply Theorem 16.51 (which we obtained with the help of the "substitution space theorems" in Theorems 12.11 
andl5~TT). 

7.1 More Detailed Discussion of Previous Trade-off Results for Resolution 

For completeness, we start this section by a slightly more detailed discussion than in Section ^ of previous 
work in this area. The question of length-space trade-offs in resolution was first studied by the first author 
in IBen()9l and more recently by the second author in [Nor09b ]. (A proceedings version work of Hertel and 
Pitassi [HP07 1 claimed a trade-off result that was simplified and improved by [Nor09b ]. The journal version 
IIHP1 0I retracted this claim due to an error in the proof.) Let us describe these works and how they compare 
to our results. 

The paper [Ben09| contains a number of results for general resolution. For instance, it shows a strong 
trade-off between clause space and width, establishing that there are formulas refutable in constant clause 
space as well as constant width, but where for any particular refutation 7r it holds that Sp(ir) ■ W(ir) = 
0(n/logn). However, the lengths-space trade-offs in [Ben09| are limited to the very restricted case of 
tree-like resolution, and do not extend to general resolution. 

In contrast, [Nor09b|, studies general, unrestricted resolution. The results therein apply to resolution 
refutation of bounded-width formulas, but again every theorem is restricted to dealing with a particular 
space measure. An unsatisfying aspect of all results in [Nor09b ] is that they use quite artificial constructions 
of formulas "glued together" from two different unsatisfiable subformulas over disjoint variable sets. In 
particular, these constructions are non-explicit. Moreover, in [Nor09b] the length-space trade-off results 
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apply only for a very carefully selected ratio of space to formula size, and display an abrupt decay of proof 
length when space is increased even by small amounts. 

In contrast, all trade-off results presented in the current paper have the following properties: 

• They apply to general, unrestricted resolution, and even extend to fc-DNF resolution. 

• They are stated for explicitly constructible formulas that have bounded width and are (minimally) 
unsatisnable. 

• They apply simultaneously to both total space and clause space (formula space for A;-DNF resolution), 
since the upper bounds are in terms of total space and the lower bounds in terms of clause/formula 
space. Moreover, recalling Definition l3.5l on page[nj we stress that all our upper bounds are in terms 
of syntactic proof systems (i.e., the usual ones) whereas the lower bounds in the trade-offs hold even 
for the much stronger semantic versions of the proof systems. 

• Finally, the trade-offs are very robust in the sense that they are not sensitive to small perturbations of 
either length or space. 

In the rest of this section, we state and prove our collection of separation and trade-off results. 

7.2 Space Hiearchy for /, -DNF Resolution 

We gave a sketch of the proof of the space hierarchy for fc-DNF resolution already in Section 1231 and the 
missing details in this sketch are easily filled in from the material in Section|6] Let us nevertheless write out 
the details here for the convenience of the reader. 

Theorem 12.51 (restated). For every k > 1 there exists an explicitly constructible family of(3(k + 1))-CNF 
formulas {-Fnl^Li °f s ^ze 0(n) such that 

1. there are 9\(k + 1) -refutations ix n : F n h in simultaneous length L(ir n ) = O(n) and formula space 
Sp(ir n ) = 0(1), but 

2. any *K(fc) -refutation of F n requires formula space Q( fc+ \/ n / log n) . 
The constants hidden by the asymptotic notation depend only on k. 

The theorem is established with the help of the following graph family. 

Lemma 7.1 ( IGT78I ). There is an explicitly constructible family ofDAGs {G„}^ =1 of size 0(n) having a 
unique sink and vertex indegree 2 such that BW-Peb(G n ) = 0(n/ log n). 

Proof of Theorem l2~5l Take the pebbling formulas defined in terms of the graphs in Lemma |7~T1 substitute 
a fc-non-authoritarian Boolean function / of arity k + 1, say XOR over k + 1 variables for concreteness, 
and appeal to Theorem l6.5l The size and width of the formula follows from part^(and Definition 13.1 41) . the 
upper bound on formula space is part|3j and the lower bound follows from part|6]of Theorem 1631 □ 

We note that for standard resolution, it is possible to get rid of the square root in the separation and 
prove that any resolution refutation of F n requires formula space 0(n/logn) (as was done in [BN08|). 
This follows by appealing to part|5]of Theorem 16 . 5 1 instead. 
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7.3 Trade-offs for Constant Space 

Shifting focus to our length-space trade-offs, which occupy the bulk of this section, our first result is that 
trade-offs can occur even for formulas refutable in constant space. What is more, there are such formulas for 
which we can prove not only a trade-off threshold, but (in the case of standard resolution) even specify the 
whole trade-off curve. We establish this by studying the following family of graphs (referred to in [ LT82 ] 
as bit reversal graphs). 

Lemma 7.2 (|LT82|). There are explicitly constructible DAGs G n of size 0(n) with a single sink and vertex 
indegree 2 having the following pebbling properties: 

1. The black pebbling price ofG n is Peb(G n ) = 3. 

2. Any black pebbling strategy V n for G n that optimizes time given space constraint^ O(n) exhibits a 
trade-off time(T> n ) = 0(n 2 / ' space(P n )). 

3. Any black-white pebbling strategy V n for G n that optimizes time given space constraints O (jjn) 
exhibits a trade-off time(V n ) = ®({n/ space(P n )) 2 ). 

What we would like to do now is to plug this theorem right into Theorem l6.5l and be done. Unfortunately, 
although this will prove to be a simple and successful strategy in general, it will not work for this particular 
family of formulas if we want to get tight trade-offs. This is so since there is a quadratic gap between 
the black-white and black-only pebbling trade-offs (see the discussion immediately after Theorem I6.2t . 
However, analyzing the strcture of the proof of Lemma l7~2l a little bit closer, it turns out that in this particular 
case it is possible to simulate optimal black-white pebblings in resolution in a time- and space-preserving 
way. (This is a special case of a more general theorem proven in INorlObl . and we refer to that paper for 
the details.) 

Lemma 7.3 ( INorlObl ). Let F n be pebbling substitution formulas over the graphs G n in Lemma U^ Then 
for any s = O^y/nj there are syntactic standard resolution refutations in length 0((n/s) 2 ) and clause 
space O(s). 

Using both Lemma l7~2l and Lemma 1731 we can establish the results stated next, which are tight for 
resolution. 

Theorem 7.4 (More detailed version of Theorem l2.6> . For any fixed positive integer K there are explicitly 
constructible CNF formulas {F n }^ =1 of size 0(n) and width 3(K + 1) such that the following holds (where 
all multiplicative constants hidden in the asymptotic notation depend only on K): 

1. The formulas F n are refutable in syntactic resolution in total space TotSp^(F n h 0) = 0(1). 

2. For any s(n) = 0(\/n) there are syntactic resolution refutations n n of F n in simultaneous length 
L^n) = 0((n/s(n)) 2 ) and total space TotSp{ir n ) = 0(s(n)). 

3. There are syntactic 9\(K + 1) -refutations ir n of F n in simultaneous length L(7r n ) = 0(n) and for- 
mula space Sp(ir n ) = 0(1). 

4. For any semantic resolution refutation ir n : F n \~0 in clause space Sp(7r n ) < s(n) it holds that 
L(ir n ) = n{(n/s(n)) 2 ). 

'The reason for including the upper bounds on space in the statement of the theorem is that no matter how much space is 
available, it is of course never possible to do better than linear time. Thus the trade-offs cannot hold when length dips below linear. 
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5. For any k < K, any semantic d\{k)- refutation ir n : F n \~0 in formula space Sp(TT n ) < s(n) must 
have length L(ir n ) = 17 ^(n/(s(n) 1 ^ fc+1 ^)) 2 ^. In particular, any constant-space 9i(k) -refutation 
must also have quadratic length. 

^From now on and for the rest of this section, we will assume unless stated otherwise that d = K + 1 
and that / = fd(x%, ■ ■ ■ , xj) = ©f =1 x« is the exclusive or function over d variables (although we will 
redundantly repeat this from time to time for increased clarity). 

Proof of Theorem U~4\ Consider the pebbling formulas Peb G [f\ defined over the bit reversal DAGs in 
Lemma 17.21 Combining Lemma 17.21 with Theorem 16.51 and improving the upper bounds by appealing 
to Lemma 1731 the theorem follows. 

Since this is our first trade-off proof, let us write it out in detail. Thus, the upper bound on total space 
in part [2 of Theorem 17 . 41 follows from the pebbling space upper bound in part [2 of Lemma 1731 combined 
with the reduction from pebbling to resolution in part|2]of Theorem l6.5l As noted above, the upper bound 
on the length-space trade-off in Part 13 of Theorem 17 . 41 does not follow from Theorem 16.51 but is obtained 
by applying Lemma 1731 The constant upper bound on 9\(K + l)-formula space in part |3] is part |3] of 
Theorem 16.5 1 The lower bound on the length-space trade-off for resolution in part|4]follows from the black- 
white pebbling trade-off in part |3] of Lemma 1731 combined with the reduction from resolution to pebbling 
in part[5]of Theorem 16.51 Finally, the weaker bound on <R( k) -tradeoffs in part|5]of Theorem 17.41 is what 
follows when we apply the weaker reduction from fc-DNF resolution to pebbling in part |6] of Theorem 16.51 
instead. □ 



7.4 Superpolynomial Trade-offs for any Non-constant Space 

It is clear that we can never get superpolynomial trade-offs from DAGs pebblable in constant space, since 
such graphs must have constant-space pebbling strategies in polynomial time by a simple counting argu- 
ment. However, perhaps somewhat surprisingly, as soon as we study arbitrarily slowly growing space, we 
can obtain superpolynomial trade-offs for formulas whose refutation space grows this slowly. This is a 
consequence of the following recent pebbling trade-off result from I N or 1 Obi , extending a construction by 
Carlson and Savage ICS80IICS82I . 

Lemma 7.5 (INo rlObl ). There is an explicitly constructible graph family T(c,r), for c,r € N + , with a 
unique sink and vertex indegree 2, having the following properties: 

1. The graphs T(c, r) are of size \V(T(c, r)) | = 0(cr 3 + c 3 r 2 ). 

2. T(c, r) has black-white pebbling price BW-Peb(T(c,r)) = r + 0(1) and black pebbling price 
Peb(T(c,r)) = 2r + 0(1)0 

3. There is a black-only pebbling ofT(c, r) in time linear in the graph size and in space 0(c + r). 

4. Suppose that V is a black-white pebbling ofT(c, r) with space(V) < r + sfor < s < c/8. Then 
the time required to perform V is lower-bounded by 

The graph family in Lemma l731 turns out to be surprisingly versatile and will occur several times below 
with different parameter settings. We now use it to prove the following theorem 

10 Note that item|2|says that the pebbling price grows linearly with r but is independent of c. Thus, the parameter s in item|4]can 
be thought of as the extra pebbling space "slack" governing how severe the time-space trade-off will be. 
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Theorem 7.6 (More detailed version of Theorem 12.8b . Let g[n) be any arbitrarily slowly growing mono- 
tone function w(l) = g(n) = 0(n 1//7 ), and fix any e > and positive integer K. Then there are explicitly 
constructible CNF formulas {F n }^ =1 of size 0(n) and width 3(K + 1) such that the following holds: 

1. The formulas F n are refutable in syntactic resolution in total space TotSp^(F n h 0) = 0(g(n)). 

2. There are syntactic resolution refutations 7r n of F n in simultaneous length L(ir n ) = O(n) and total 
space TotSp(ir n ) = O ^(n/g 2 (n)) 1 ^ 3 ^. 

3. There are syntactic %\(K + 1) -refutations ir n of F n in simultaneous length L(-7r n ) = O(n) and for- 
mula space Sp(ir n ) = 0(1). 

4. Any semantic resolution refutation of F n in clause space O ^(n/g^n)) 1 ^ 3 must have superpoly- 
nomial length. 

5. For any k < K, any semantic d\{k)-refutation in formula space O ^(ra/g^n)) 1 ^ 3 ^ 1 ^ must 
have superpolynomial length. 

All multiplicative constants hidden in the asymptotic notation depend only on K, e and g. 

We remark that the upper-bound condition g(n) = 0(n 1 / 7 ) is very mild and is there only for technical 
reasons in this theorem. If we allow the minimal space to grow as fast as n e for some e > 0, then there are 
other pebbling trade-off results that can give even stronger results for resolution than the one stated above, 
(see, in particular, Section l7~6l l. Thus, the interesting part is that g(n) is allowed to grow arbitrarily slowly. 

Proof of Theorem \7^6\ Consider the graphs T(c, r) in Lemma FOI We want to choose the parameters c and 
r in a suitable way so that we get a family of graphs in size n = (cr 3 + c 3 r 2 ) . If we set 



for g(n) = 0(n 1 / 7 }, this forces 



r = r(n) = g(n) (7.1) 



c(n) = e(</n/ 5 2(„)) . (7.2) 



Consider the graph family {G n }^ =1 defined by G n = T(c(n),r(n)) as in ( 17.11 ) and d7.2t . which is a family 
of size 6(n). Consider the pebbling formulas F n = Peb G [/], substitute a /c-non-authoritarian Boolean 
function / of arity k + 1, say XOR over k + 1 variables for concreteness, and appeal to the translation 
between pebbling and resolution in Theorem l6.5l 

Part El of Lemma [73] yields that TotSp^(F n h 0) = 0(g(n)). Also, the black pebbling of G n in part|3] 
yields a linear-time refutation ir n : F n h with TotSp (ir n ) = O ( \/n/g 2 (n)) . 

Now set the parameter s in part|4]of Lemma l7~5l to s = c 1 ^' for e' = ke where n is chosen large enough 
(depending on K). Then for large enough n we have s < c/8 and the trade-off in part@]applies. Combining 

f^^l 1 /3 

the pebbling trade-off there with Theorem 16.5 1 we get that if the clause space is less than (n/ g 2 {n)) e , 

then the required length of the resolution refutation grows as (ft(ce')) r = (n(n/g 2 (n))) e9{n) which is 
superpolynomial in n for any g{n) = w(l). The rest of the theorem follows in a similar fashion. In 
particular, to get Part|5]above use Part|6]of Theorem 16. 51 which gives a bound that is a (k + l)-root of the 
bound we get for semantic resolution. □ 
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7.5 Robust Superpolynomial Trade-offs 

We now know that there are polynomial trade-offs in resolution for constant space, and that going ever so 
slightly above constant space we can get superpolynomial trade-offs. The next question we want to focus on 
is how robust trade-offs we can get. That is, over how large a range of space does the trade-off hold? Given 
minimal refutation space s, how much larger space is needed in order to obtain the linear length refutation 
that we know exists for any pebbling contradiction? 

The answer is that we can get superpolynomial trade-offs that span almost the whole range between 
constant and linear space. We present two different results illustrating this. 

Theorem 7.7. For any fixed positive integer K, there are explicitly constructible CNF formulas {i 7 ^}^^ 
of size 0(n) and width 3(K + 1) such that: 

1. Every formula F n is refutable in syntactic resolution in total space TotSp^(F n h 0) = O(logn). 

2. There is a syntactic resolution refutation ix n : F n h in simultaneous length L(ir n ) = O(n) and total 
space TotSp(ir n ) = O f \J n l l°g 2 ■ 

3. There are syntactic 9\(K + 1) -refutations ir n of F n in simultaneous length L(i: n ) = O(n) and for- 
mula space Sp(ir n ) = 0(1). 

4. There is a constant k > such that any semantic resolution refutation ir n : F n h in clause space 
Sp(n n ) < K^Jn/ log 2 n must have length L(n n ) = n n ^ lo ^ n \ 

5. For any k < K, any semantic k-DNF resolution refutation ir n : F n h in formula space Sp(ir n ) = 
o ((n/log 2 n) x /(3(*+i))) must have length L(vr n ) = n n ( lo s lo s n ). 

The constant k as well as the constants hidden in the asymptotic notation are independent ofn. 

Proof. Consider the graphs T(c,r) in Lemma 1731 with parameters chosen so that c = 2 r . Then the size 
of r(c, r) is 0(r 2 2 3r ). Let r(n) = max{r : r 2 2 3r < n} and define the graph family {G n }^ ( L 1 by 
G n = T(2 r ,r) for r = r(n). Finally, consider the pebbling formulas F n = Peb@ [/] with the help of 
Theorem l6.5l 

Translating from G n back to T(c, r) we have parameters r = 0(log n) and c = 0((n/ log 2 n) 1 / 3 ), so 
Lemma 1731 yields that TotSp^(F n h 0) = O(logra). Also, the black pebbling of G n in Lemma 1731 yields 
a linear-time refutation 7r„ : F n h with TotSp{ir n ) = 0((n/ log 2 n) 1 / 3 ). 

Setting s = c/8 in the trade-off in part|4]of Lemma l731 shows that there is a constant k such that if the 
clause space of a refutation n n : F n h drops below k ■ (n/log 2 n) 1 / 3 < (r + 2) + s, then we must have 

L{n n ) > 0(l) r • r! = n n(loglogn) (7.3) 

(where we used that r = B(logn) for the final equality). The rest of the theorem follows in an analogous 
fashion. In particular, to get Part[5]above use Part[6]of Theorem l6 . 5 1 which gives a bound that is a ( k + l)-root 
of the bound we get for semantic resolution. 

□ 

Sacrificing a square at the lower end of the interval, we can improve the upper end to n/log n. 

Theorem 7.8 (More detailed version of Theorem I2.9> . For any positive integer K, there are explicitly 
constructible CNF formulas {F n }^ =1 of size 0(n) and width 3(K + 1) such that the following holds (where 
the hidden constants depend only on K): 
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1. The formulas F n are refutable in syntactic resolution in total space TotSpn(F n h 0) = 0(log 2 n). 

2. There are syntactic resolution refutations 7r n of F n in simultaneous length L(ir n ) = 0(n) and total 
space TotSp(jr n ) = 0(n/logn). 

3. There are syntactic %\{K + 1) -refutations tt ti of F n in simultaneous length L(n n ) = 0(n) and for- 
mula space Sp(ir n ) = 0(1). 

4. Any semantic resolution refutation of F n in clause space Sp(ir n ) = o(n/logn) must have length 

5. For any k < K, any semantic 9\(k) -refutation informula space Sp(ir n ) = o ((n/ log n) 1 ^ fc+1 - ) ) must 
have length L(vr n ) = n "(i°gl°g«). 

For this theorem we need the following graph family from [LT82|, which is built as stacks of supercon- 
centrators, i.e., by placing graphs with very good connectivity properties on top of one another. 

Lemma 7.9 (|LT82|). There is a family of explicitly constructible graphs <I>(m, r) with a unique sink and 
vertex indegree 2 such that the following holds: 

1. &(m, r) has size Q(rm). 

2. Pe£)(<3?(m, r)) = O(rlogm). 

3. There is a linear-time black pebbling strategy V for <l>(m, r) with space(V) = 0(m). 

4. IfV is a black-white pebbling strategy for $(m, r) in space s < m/20, then time(V) > m ■ (gf|) r - 

Proof of Theorem IZ8l As usual, pick / = fd{x\, . . . ,Xd) = ©f = i^i for d = K + 1, and consider the 
pebbling formulas Psb^ mr s[f] defined over stacks of superconcentrators $(m,r) as in Lemma I7~9l with 
m = 20T and r = [n/T\ for T = 0(n/logn). Theorem 17 . 8 1 no w follows by combining Lemma I7T91 with 
Theorem l6.5l □ 

We remark that the results in Theorem l7.8l are perhaps slightly stronger than those in Theorem 17 .7 1 since 
they span a much larger (although non-overlapping) space interval. However, they require a very much 
more involved graph construction with worse hidden constants than the very simple and clean construction 
underlying Theorem l7.7l 



7.6 Exponential Trade-offs 

Superpolynomial trade-offs are all fine and well, but can we get exponential trade-offs? We conclude this 
section by giving strong answers in the affirmative to this question. 

The same counting argument that was mentioned in the beginning of Section l7~4l tells us that we can 
never expect to get exponential trade-offs from DAGs with polylogarithmic pebbling price. However, if we 
move to graphs with pebbling price $7(n e ) for some constant e > 0, pebbling formulas over such graphs can 
exhibit exponential trade-offs. 

We obtain our first such exponential trade-off, which also exhibits a certain robustness, by again studying 
the DAGs in Lemmal7~5l 

Theorem 7.10. For any positive integer K set e = 3 ^ +8 and 5 = (K + 2)e. Then there is a constant 
< 7 < 1 (depending only on K) and explicitly constructible CNF formulas {Fn} 1 ^^ of size 0(ra) and 
width 3(K + 1) such that the following holds: 
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1. The formulas F n are refutable in syntactic resolution in total space TotSp<^{F n h 0) = 0(n e ). 

2. There are syntactic resolution refutations 7r n of F n in simultaneous length L(TT n ) = 0(n) and total 
space TotSp(ir n ) = O (n s ). 

3. There are syntactic 9\(K + 1) -refutations ix n of F n in simultaneous length L(ir n ) = 0(n) and for- 
mula space Sp(ir n ) = O (n S )). 

4. Any semantic resolution refutation of F n in clause space n s /10 must have length at least n e \. 

5. For any k < K, any semantic 9\(k) -refutation in formula space ^n 5 ^ fe+1 ^ must have exponential 
length, i.e., length at least rf\. 

All multiplicative constants hidden in the asymptotic notation depend only on K. 

Note that since 5 > {k + l)e the tradeoff we get for /c-DNF resolution is nontrivial. By this we mean 
that although there exist refutations requiring space 0(n e ), as long as we use space that is somewhat smaller 
than n <5// ( fc+1 ) the refutation length is exponential. 

Proof of Theorem IZ61 Consider again the graphs F(c, r) in Lemma l731 Set r = r(n) = n e and c = c(n) = 
n s . The graph family {G n }™ =l defined by G n = T(c(n).r(n)) as in (17.11) and (I7.2t . is a family of size Q(n). 
Consider the pebbling formulas F n = Pebg [f], where / is a fc-non-authoritarian Boolean function / of 
arity k + 1, say XOR over k + 1 variables for concreteness, and appeal to the translation between pebbling 
and resolution in Theorem l6.5l 

PartEJof Lemma 1731 yields that TotSp^(F n h 0) = 0(n t ). Also, the black pebbling of G n in part|3] 
yields a linear-time refutation 7r n : F n h with TotSp(ir n ) = 0(n s ). 

Now if s < n s /10 inspection of part|4]of Lemma l731 shows that for large enough n we get an exponential 
lower bound on the pebbling time of n e l. Combining this with Theorem 16. 51 gives our time lower bound for 
semantic resolution. Finally, to get the lower bound for semantic A>DNF resolution we apply Part |6] of 
Theorem 16. 5 1 which gives a bound that is a (k + l)-root of the bound we get for semantic resolution. □ 

We remark that there is nothing magic in our particular choice of parameters in the proof of Theo- 
rem Other parameters could be plugged in instead and yield slightly different results. 

Now that we know that there are robust exponential trade-offs for resolution and fc-DNF resolution, 
we want to obtain exponential trade-offs for formulas with their minimal refutation space being as large as 
possible. 

The higher the lower bound on space is, the more interesting the trade-off gets. It seems reasonable that 
to look at and analyze a CNF formula, a SAT solver will at some point use at least linear space. If so, it 
is not immediate to argue why the SAT solver would later work hard on optimizing lower order terms in 
the memory consumption and thus get stuck in a trade-off for relatively small space. Ideally, therefore, we 
would like to obtain trade-offs for linear or even superlinear space (if there are such trade-offs, that is). For 
such formulas, we would be more confident that the trade-off phenomena should also show up in practiced 

It is clear that pebbling contradictions can never yield any trade-off results in the superlinear regime, 
since they are always refutable in linear length and linear space simultaneously. Also, all trade-offs obtain- 
able from the graphs in Lemma 17.51 will be for space far below linear. However, using the following two 



"Having said that, we also want to point out that the case can certainly be made that even sublinear space trade-offs might be very 
relevant for real life applications. Intriguingly enough, pebbling contradictions over so-called pyramid graphs (|Coo74 Kla85|) 
might in fact be an example of this. We know that these formulas have short, simple refutations, but in | SBK04 1 it was shown that 
state-of-the-art clause learning algorithms can have serious problems with even moderately large pebbling contradictions. (Their 
"grid pebbling formulas" are exactly our pebbling contradictions using substitution with binary, non-exclusive or.) We wonder 
whether the high lower bound on clause space can be part of the explanation behind this phenomenon. 
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results we can get exponential trade-offs for space almost linear, or more precisely for space as large as 
0(n/ log re). 

Lemma 7.11 (| LT82 1). For every directed acyclic graph G with n vertices and hounded indegree t, and for 
every space parameter s satisfying (3£ + 2)n/logre < s < re, there is a black pebbling strategy V for G 
with space(V) < s and time(V) < s • 2 2 ° (n/s) . 

Lemma 7.12 (|LT82|). There exist constants e, k > such that for all sufficiently large integers n,s 
satisfying nn / log n < s < re, we can find an explicitly constructible single-sink DAG G with indegree 2 
and number of vertices at most n such that any black-white pebbling strategy V for G with space{V) < s 
must have time^) > s ■ 2 2m/a . 

Note that the graph G in Lemma 17. 121 depends on the pebbling space parameter s. Lengauer and 
Tarjan conjecture that no single graph gives an exponential time-space tradeoff for the whole range of 
s G [re/ log re, re], but to the best of our knowledge this problem is still open. 

Theorem 12.101 (restated). Let k be any sufficiently large constant. Then there are k- CNF formulas F n of 
size O(re) and a constant k' <C k such that: 

1. The formulas F n have syntactic resolution refutations in total space k' ■ re/ log re. 

2. F n is also refutable in syntactic resolution in length O(re) and total space O(re) simultaneously. 

3. However, any semantic refutation of F n in clause space at most k ■ re/ log re has length exp(re^( 1 )). 

Proof. Appeal to Lemmas l7.11l and l7.12l in combination with Theorem 16. 51 in the same way as in previous 
proofs in this section. □ 

We remark that Lemma 17 .121 in combination with Lemma 17.1 II can be used to obtain DAGs (and thus 
CNF formulas) with other superpolynomial trade-offs as well for different space parameters in the range 
above re/ log re up to n/ log log re. For simplicity and conciseness, however, we only state the special case 
above. 

As was discussed in Section l2~3l Theorem 12. 101 does not yield any provably nontrivial trade-offs for 
£H(fc), since the space range where the trade-off kicks in is so low that we do not know whether there 
actually exist any 9t(fc)-refutation in such small space. We do get weaker, though exponential (and provably 
non-vacuous) trade-offs for £H(fc) in Theorem 17.101 

8 Concluding Remarks 

We end this paper by discussing some open questions related to our reported work. 

Resolution For the length, width, and clause space measures in resolution, there are known upper and 
lower worst-case bounds that essentially match modulo constant factors. This is not the case for total space, 
however. 

Open Question 1. Are there polynomial-size CNF formulas of width 0(1) which require total resolution 
refutation space TotSp^(F h 0) = Q((size o/F) 2 ) ? 

The answer has been conjectured by [ABRW02] to be "yes", but as far as we are aware, there are no 
stronger lower bounds on total space known than those that follow trivially from corresponding linear lower 
bounds on clause space. Thus, a first step would be to show superlinear lower bounds on total space. 
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One way of interpreting the results of the current paper is that time-space trade-offs in pebble games 
carry over more or less directly to the resolution proof system (modulo the technical restrictions discussed 
in Section The resolution trade-off results obtainable by this method are inherently limited, however, 
in the sense that pebblings in small space can be seen never to take too much time by a simple counting 
argument. For resolution there are no such limitations, at least not a priori, since the corresponding counting 
argument does not apply. Thus, one can ask whether it is possible to demonstrate even more dramatic 
time-space trade-offs for resolution than those obtained via pebbling. 

To be more specific, we are particularly interested in what trade-offs are possible at the extremal points 
of the space interval, where we can only get polynomial trade-offs for constant space and no trade-offs at all 
for linear space. 

Open Question 2. Are there superpolynomial trade-offs for formulas refutable in constant space? 

Open Question 3. Are there formulas with trade-offs in the range space > formula size? Or can every 
refutation be carried out in at most linear space? 

We find Open Question |3] especially intriguing. Note that all bounds on clause space proven so far, 
inlcuding the trade-offs in the current paper, are in the regime where the space is less than formula size 
(which is quite natural, since by [ET01 1 we know the size of the formula is an upper bound on the minimal 
clause space needed). It is unclear to what extent such lower bounds on space are relevant to state-of-the-art 
SAT solvers, however, since such algorithms will presumably use at least a linear amount of memory to 
store the formula to begin with. For this reason, it seems to be a highly interesting problem to determine 
what can be said if we allow extra clause space above linear. Are there formulas exhibiting trade-offs in 
this superlinear regime, or is it always possible to carry out a minimal-length refutation in, say, at most a 
constant factor times the linear upper bound on the space required for any formula? As was noted above, 
pebbling formulas cannot help answer these two questions, since they are always refutable in linear time and 
linear space simultaneously by construction, and since constant pebbling space implies polynomial pebbling 
time. 

A final problem related specifically to standard resolution is that it would be interesting to investigate 
the implications of our results for applied satisfiability algorithms. 

Open Question 4. Do the trade-off phenomena we have established in this paper show up "in real life" for 
state-of-the-art DPLL based SAT-solvers, when run on the appropriate pebbling contradictions (or varia- 
tions of such pebbling contradictions)? 

A Stronger space separation for /c-DNF resolution We have proven a strict separation between 
/c-DNF resolution and (/c+l)-DNF resolution by exhibiting for every fixed k a family of CNF formulas of 
size n that require space SI ( fc+ y / nj log n) for any /c-DNF resolution refutation but can be refuted in constant 
space in (fc+l)-DNF resolution. This shows that the family of D\(k) proof systems form a strict hierarchy 
with respect to space. 

As has been said above, however, we have no reason to believe that the lower bound for 9i(k) is tight. In 
fact, it seems reasonable that a tighter analysis should be able to improve the bound to at least Q ( \Jnj log n) 
and possibly even further. The only known upper bound on the space needed in *R(k) for these formulas is 
the 0(n/ log n) bound that is easily obtained for standard resolution. Closing, or at least narrowing, the gap 
between 0( fe+ y / n / log n) and 0(n/ log n) is hence an open question. 

Understanding minimally unsatisfiable /c-DNF sets It seems that the problem of getting better 
lower bounds on space for /c-DNF resolution is related to the problem of better understanding the structure 
of minimally unsatisfiable sets of /c-DNF formulas. Although the correspondence is more intuitive than 
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formal, it would seem that progress on this latter problem would probably translate into sharper lower 
bounds for 9\(k) as well. The reason for this hope is that the asymptotically optimal results for standard 
resolution in this paper can in some sense be seen to follow from (the proof technique used to obtain) the 
tight bound for CNF formulas in Theorem l2.13l 

What we are able to prove in this paper is that any minimally unsatisfiable A;-DNF set D (for k a fixed 
constant) must have at least f2 ( fe+ -y/|D[) variables (Theorem I2.14t but the only explicit constructions of 
such sets that we where able to obtain had 0(|0|) variables (Lemma [2.151 . As has already been mention, 
the recent work [NR09| unexpectedly improved the lower bound to roughly 0( ^/|D|j. This appears to be 
a natural and interesting combinatorial problem in its own right, and it would be very nice to close the gap 
between the upper and lower bound. 

We have the following conjecture, where for simplicity we fix k to remove it from the asymptotic 
notation. 

Conjecture 5. Suppose that D is a minimally unsatisfiable k-DNF set for some arbitrary but fixed positive 
integer k. Then the number of variables in D is at most 0(|D|) fc . 

Proving this conjecture would establish asymptotically tight bounds for minimally unsatisfiable fc-DNF 
sets (ignoring factors involving the constant k). 

Generalizations to Other proof systems We have presented a "substitution space theorem" for res- 
olution as a way of lifting lower bounds on the number of variables to lower bounds on (clause) space, and 
have then extended this result by lifting lower bounds on the number of variables in resolution to lower 
bounds on formula space in the much stronger k-DNF resolution proof systems. It is a natural question to 
ask whether our techniques can be extended to other proof systems as well. 

We remark that the translations in Sections |4] and |5]of refutations of substitution formulas in some other 
proof system V via projection to resolution refutations of the original formula seem extremely generic and 
robust in that they do not at all depend on which derivation rules are used by V nor on the class of formulas 
with which V operates. The only place where the particulars of the proof system come into play is when we 
actually need to analyze the content of the proof blackboard. As described in the introduction, this happens 
at some critical point in time when we know that the blackboard of our translated (projected) resolution 
proof mentions a lot of variables, and want to argue that this implies that the blackboard of the V -proof must 
contain a lot of formulas (or possibly some other resource that we want to lower-bound in V). This part of 
the analysis is the (essentially tight) result for resolution in Lemma 14171 and the (likely not tight) bound for 
fc-DNF sets in Lemma 1531 in this paper. Any corresponding result for some other proof system V would 
translate into lower bounds for V in terms of lower bounds on variable space in resolution. 
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